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Chapter 1 
Introduction 



This text is an introduction to the study of NIP (or dependent) theories. It 
is meant to serve two purposes. The first is to present various aspects of NIP 
theories and give the reader the background material needed to understand 
almost any paper on the subject. The second is to advertise the use of hon- 
est definitions, in particular in establishing basic results, such as the so-called 
shrinking of indiscernibles. Thus although we claim no originality for the the- 
orems presented here, a few proofs are new, mainly in chapters 3 and 4. 

We have tried to give a horizontal exposition, covering different, sometimes 
unrelated, topics at the expense of exhaustivity. Thus no particular subject 
is dealt with in depth and only low-level results are included. The choices 
made reflect our own interests and are certainly very subjective. In particular, 
we say very little about algebraic structures and concentrate on combinatorial 
aspects. Overall, the style is concise, but hopefully all details of the proofs are 
given. A small number of facts are left to the reader as exercises, but only once 
or twice are they used later in the text. 

The material included is based on the work of a number of model theorists. 
Credits are usually not given alongside each theorem, but are recorded at the 
end of the chapter. 

We have included almost no preleminaries about model theory, thus we 
assume some familiarity with basic notions, in particular concerning compact- 
ness, types and indiscernibles sequences. Those prerequisites are exposed in 
various books such as that of Poizat [72], Marker [61] or Hodges [3D]. Famil- 
iarity with stability theory is not at all required. We will however make a few 
remarks concerning stable formulas or structures inside the text. The basic 
facts that we need are summarized in Appendix B. 
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History of the subject 

In his early works on classification theory, Shelah structured the landscape of 
first order theories by drawing dividing lines defined by the presence or absence 
of different combinatorial configurations. The most important one is that of 
stability. In fact, for some twenty years, pure model theory did not venture 
much outside of stable theories. Shelah discovered the independence property, 
when studying the possible behaviors for the function relating the size of a 
subset to the number of types over it. The class of theories lacking the inde- 
pendence property, or NIP theories, was studied very little in the beginnings, 
especially by Shelah and Poizat (see [721 Chapter 12] for an account of those 
early works). 

As the years passed, various structures were identified as being NIP: most 
notably henselian valued fields of characteristic with NIP residue field and 
ordered group (Delon [26]), the field Q p of p-adics ([H]) and ordered abelian 
groups ([36]). However, NIP theories were not studied per se. In [67], Pillay and 
Steinhorn, building on work of van den Dries, defined o-minimal theories as a 
framework for tame geometry. This has been a very active area of research ever 
since. Although it was noticed from the start that o-minimal theories lacked 
the independence property, very little use of this fact was made until recently. 
Nevertheless, o-minimal theories provide a wealth of interesting examples of 
NIP structures. 

In the years 2000, the interest in NIP theories was rekindled and the sub- 
ject has been expanding ever since. First Shelah initiated a systematic study 
which lead to a series of paper: jH2], [83], [77], [EH], [HI]. Amongst other things, 
he established the basic properties of forking, generalized a theorem of Baisa- 
lov and Poizat on externally definable sets, defined some subclasses, so called 
"strongly-dependent" and "strongly + -dependent" . This work culminates in 
[SI] with the proof that NIP theories haves few types up to automorphism 
(over saturated models). Parallel to this work, Hrushovski, Peterzil and Pillay 
developed the theory of measures (a notion introduced by Keisler in [55]) in 
order to solve Pillay's conjecture on definably compact groups in o-minimal 
theories. 

A third line of research starts with the work of Hrushovski, Haskell and 
Macpherson on algebraically closed valued fields (ACVF) and in particular 
on metastability ([38]). This lead to Hrushovski and Loeser giving a model 
theoretic construction of Berkovich spaces in rigid geometry as spaces of stably- 
dominated types, which made an explicit use of the NIP property along with 
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the work on metastability. 

Motivated by those results, a number of model theorists got interested 
in the subject and investigated NIP theories in various directions. We will 
present some in the course of this text and mention others at the end of this 
introduction. 

Let us end this general introduction by mentioning where NIP sits with 
respect to other classes of theories. First, all stable theories are NIP, as are 
o-minimal and C-minimal theories. Another well-studied extension of stability 
is that of simple theories (see [95]), however it is in a sense orthogonal to NIP: 
a theory is both simple and NIP if and only if it is stable. Simple and NIP 
theories both belong to the wider class of NTP2 theories (defined in Chapter 
5). 

Organization of this text 

Aside from the introduction and appendices, the text is divided into 6 chapters, 
each one focussing on a specific topic. In Chapter 2, we present the classical 
theory as it was established by Shelah and Poizat. We first work formula- by- 
formula giving some equivalent definitions of NIP. The original counting types 
characterization is included, although it will not be used later in the text. 
Starting at Section [2. 1.31 and throughout most of the text, we assume that our 
ambient theory T is NIP. That assumption will be dropped only for the first 
three sections of Chapter Of course, most of the results could be established 
for an NIP formula (or type) inside a (possibly) independent theory, but for 
the sake of clarity we will not work at this level of generality. The end of 
Chapter 2 is concerned with invariant and generically stable types. 

In Chapter 3 we define honest definitions. They serve as a substitute to 
definability of types in NIP theories. We use them to prove Shelah's theorem on 
expanding a model by externally definable sets and the very important results 
about shrinking of indiscernibles. 

Chapter 4 deals with dp-rank and strong dependence. In the literature, one 
can find up to three different definitions of dp-rank, based on how one handles 
the problem of almost finite, non-finite rank. None of them is perfect, but we 
have decided to use the same convention as in Adler's paper [2] on burden, 
although we refrain from duplicating limit cardinals into k_ and k. Instead, 
we define when dp-rk(p) < k, and it may happen that the dp-rank of a type is 
not defined (for example it can be < Ko but greater than all integers). 

In Chapter 5, we study forking and dividing. The main results are bdd(A)- 
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invariance of non-forking extension from Hrushovski and Pillay's |47j . and 
Chernikov and Kaplan's theorem about equality of forking and dividing over 
models. The right context for this latter result is NTP2 theories, but here again 
we assume NIP which slightly simplifies some proofs. 

The last two chapters have a different flavor. In Chapter 6, we change the 
framework to that of finite combinatorics. We are concerned with families of 
finite VC-dimension over finite sets. The finite and infinite approaches come 
together to prove uniformity of honest definitions. In Chapter 7, the two 
frameworks are combined with the introduction of Keisler measures. Here we 
stop short of giving applications to fsg groups and compact domination which 
can be found in [37] and [32] • 

Finally, three appendices are included. The first one gives some algebraic 
examples and in particular records some facts about valued fields. Most of the 
proofs are omitted, but we explain how to show that those structures are NIP. 
The other two appendices are very short and collect results about stability 
theory and probability theory for reference in the text. 

Further topics 

Many topics are entirely omitted from those notes. We mention some here 
with pointers to papers that the interested reader may turn to. We restrict 
ourselves to topics in pure model theory, and of course, we are by no means 
exhaustive. 

• Compact domination: We will define generically stable measure in Chap- 
ter 7. Further information concerning them can be found in [32] and [33] • It 
is proved in [37] that a definable compact group in an o-minimal theory is 
dominated in some sense by a compact group. The link with generically stable 
measures in made in [32] where another compact domination result concerning 
type spaces is stated. 

• Shelah's type decomposition theorems: Shelah showed in [85] that one 
can decompose types over saturated models in a finitely satisfiable component 
and a "directed" part. He then refined this greatly in [S3] to show that over 
saturated models, there are few types up to automorphism. 

• Directionality: In [53], Kaplan and Shelah group NIP theories into three 
classes called small, medium and large directionality depending on the number 
of coheirs that a type can have. 

• Extraction of indiscernibles: Shelah proves in [77] that in a strongly- 
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dependent theory, one can extract an infinite indiscernible sequence from any 
long enough sequence. In [M], similar statements are proved for theories of 
small and medium directionality. Counter-examples to the general case are 
given in [53] and [52] . 

• Computing VC-density: The problem of computing precisely the VC- 
density of formulas in various NIP theories is investigated in [5] and [9]. 

• Groups interpretable in NIP structures: Few general statements are 
known about groups interpretable in NIP structures. Kaplan and Shelah 
study various chain conditions. In another direction, pseudofinite NIP groups 
are studied in [60] and omega-categorical groups in [56] . 

It is proved in [50] that NIP fields have no Artin-Schreier extensions. 

• Notions of minimality: We will define in 14.251 the class of dp-minimal 
theories. It provides us with a general combinatorial framework to study o- 
minimal and C-minimal theories, amongst others. Dp-minimal theories are 
studied for example in [HI], [28] and [HE]. A more restrictive class is that of 
VC-minimal theories which is defined by Adler in [3] and investigated in [34] 
and [25] . 

• Dependent pairs: A pair is a structure M with an additional predicate 
P(x) naming a given subset. Various pairs of NIP structures have been studied: 
[9"2] . [15] . [T6~| . [35] . [T7] . The paper [23] contains a general criterion to ensure 
that the pair is still NIP. 

• Distal theory: A distal theory is an NIP theory which is in some sense 
"purely unstable". This notion is defined and studied in [86] . The paper [22] 
also contains some facts about it. 

Acknowledgments The material presented here was exposed during a 
series of lectures in Paris. I would like to thank all the participants of this 
seminar: Elisabeth Bouscaren, Zoe Chatzidakis, Pablo Cubides Kovacsics, 
Frangoise Delon, Martin Hils, Samaria Montenegro, Frangoise Point, Silvain 
Rideau, Patrick Simonetta. Special thanks to Martin Hils for a number of 
helpful comments on this text. 

1.1 Preliminaries 

We work with a complete, one-sorted, theory T in a language L. We have a 
monster model U which is K-saturated and homogeneous. A subset A C U is 
small if it is of size less than R. 
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We do not distinguish between points and tuples. If a is a tuple of size \a\, 
we will usually write a E A to mean a E A' a L Similarly, used to 

denote tuples of variables. 

If A C U is any set, and 0(x) is a formula, then 0(A) = {a E A^ : hi |= 
0(a)}. The set of types over A in the variable x is denoted by S^A). We will 
often drop the x. If p E S X (A), we might write p x or p(x) to emphasize that p 
is a type in the variable x. We say that a type p concentrates on a definable 
set 0(x) if p h 0(x). If A C B and p is a type over £?, we denote by p \ A, or 
by p\a the restriction of p to A. 

We use the notation 0° to mean -i0 and 1 to mean 0. If 0(x; y) is a 
formula, a 0-type over A is a maximal consistent set of formulas of the form 
0(x; a) e , for a E A and e G {0, 1}. The set of 0-types over A is denoted by 
S+{A). 

A global type, is a type over hi. 
1.1.1 Indiscernible sequences 

We will typically denote sequences of tuples by / = (cij : i E X) where X is 
some linearly ordered set. The order on X will be denoted by <x or simply < 
if no confusion arises. If / = (dj : i G X) and J = (6j : j E 3\ then we write 
the concatenation of / and J as / + J. It has / as initial segment and J as the 
complementary final segment. We use the notation (a) to denote the sequence 
which has a as unique element. 

We say that the sequence / is endless if the indexing order X has no last 
element. 

Let A be a finite set of formulas and A a set of parameters. A (possibly 
finite) sequence I = (a^ : i G X) is A-indiscernible over A, if for every integer 
k and two increasing tuples z'i <% • • ■ <x ik and ji <x • • ■ <x jk, b E A and 
formula 0(xi, . . . ,x k ;y) E A, we have 0(a il5 . . . , a ik ; b) <H- 0(0^, . . . ,aj k ;b). 

An indiscernible sequence is an infinite sequence which is A-indiscernible 
for all A. If the sequence I is indiscernible over A, then we define the EM- 
type of / over A to be the sequence (p n : 1 < n < u) of types where 
p n = tp(a« 1 , . . . , di n /A) for some (any) i\ <x . . . <x in- If A = 0, then we can 
omit it. We will write / =a AI J to mean that I and J are two A-indiscernible 
sequences having the same E'M-type over A. It follows from compactness that 
if I is an indiscernible sequence and J is any small infinite linear order, than 
we can find an indiscernible sequence J indexed by J and of same EM-type 
as I. 
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A sequence I is totally indiscernible (or set indiscernible) if every per- 
mutation of it is indiscernible. 

If (bj : j < a) is any sequence and I = (a^ : % G X) is an indiscernible 
sequence, we say that / is based on the sequence ib^) i<a if for every %\ < x 
■ ■ ■ <x i n and formula <p(x±, . . . , x n ) G L, there are j\ < . . . < j n < a such that 
0(aj i; . . . , a>i n ) 0(6^, . . . , bj n ) holds. We say based on {bi)i <a over A if we 
allow <f>(xx, . . . , x n ) to range over L(A). 

Ramsey's theorem and compactness imply that if (6j)j< Q is any sequence, 
then there is an indiscernible sequence based on it. 
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Chapter 2 

Definitions and basic properties 



In this chapter, we introduce the basic objects of our study. We first define the 
notion of an NIP formula. The combinatorial definition is not very handy, and 
we give an equivalent characterization involving indiscernible sequences which 
is the one we will most often use. We then define NIP theories as theories in 
which all formulas are NIP. From that point on, we restrict our attention to 
that class of theories. We study invariant types and Morley sequences. Finally, 
we define the notion of a generically stable type and give some properties. 

To illustrate the notions considered, we prove some results on definable 
groups in NIP theories: the Baldwin-Saxl theorem, and Shelah's theorem on 
existence of infinite definable abelian subgroups. 

2.1 NIP formulas 

Definition 2.1. A partitioned formula <p(x;y) is NIP if we cannot find an 
infinite set A of |x|-tuples such that: 

(IP)^ for all A C A, there is some bA of size \y\ such that 

(j){A- b Ao )=A . 

If a formula is not NIP, we say that it has IP. 

Remark 2.2. If 4>{x] y) is NIP, then by compactness, there is some N such that 
(IP)^ does not hold for any A of size at least N . 

The maximal integer N for which there is some A of size N such that 
(IP)^ holds is called the VC-dimension of <fi. If there is no such integer {i.e., 
the formula has IP), then we say that the VC-dimension is infinite. 
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Example 2.3. 

• Let T = DLO be the theory of dense linear orders with no endpoints. 
Then the formula 4>(x;y) = x < y is NIP of VC-dimension 1. Indeed, if we 
have <2i < a2, then we cannot find some 6{ a2 } such that 

U |= -i0(ai; b {a2} ) A <p(a 2 ] b {a2} ). 

• If <f)(x; y) is a stable formula, then it is NIP. 

• If T is the theory of arithmetic, then the formula <j)(x; y) = "x divides y" 
has IP. To see this, take any JVgN and A = {p , . . . .p^-i} a set of distinct 
prime numbers. Then for any A C A, set b^ to be I"I a€ _A a - We have 
|= <f)(a, 6^0 ) < ^ === ^ a £ Aq. This shows that <fr(x; y) has infinite VC-dimension. 

• If T is the random graph in the language L = {R}, then the formula 
y) = xRy has IP. 

• If T is a theory of an infinite boolean algebra, in the language {<, U, Pi}, 
then the formula x < y has IP. 

Lemma 2.4. The formula 4>(x; y) is NIP if and only if (j) opp (x; y) = 4>(y; x) is 
NIP. 

Proof. Assume that y) has IP. Then by compactness, we can find some 
A = {ai : i G such that (IP)^ holds, witnessed by tuples bA , A C A. 

Let B = {bj : j G u} where bj := b^ and Aj := {D C w : j G -D}. Then for 
any B Q G B , we have 

H </>(aB a ,bj) <^ j G .Bq. 

This shows that (IP)^opp,b holds. Therefore <p opp has IP. Note however that 
cj) opp need not have the same VC-dimension as 0. □ 

We now give an equivalent characterization of NIP which is the one we will 
most often use. 

Lemma 2.5. The formula 4>{x\ y) has IP if and only if there is an indiscernible 
sequence (a, : i < oj) and a tuple b such that 

\= 4>{cii] b) <^=^> i is even. 

Proof. (<=): Assume that there is a sequence (a^ : i < u>) and a tuple b as 
above. Let A C uj. We show that there is some 6a such that <f)(ai] b^) holds 
if and only if z G A. We can find an increasing one-to-one map r : u — > uj 
such that for all i £ uj, r(i) is even if and only if i is in A. Then the map 
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sending b{ to b T u), for alH < u is a partial isomorphism. It extends to a global 
automorphism a. Then take a a = a -1 (a). 

(=>-): Assume that 4>(x; y) has IP. Let A be an infinite set of |x|-tuples such 
that (IP)^a holds. By compactness we may assume that A is very large and 
then by Erdos-Rado, we can find some indiscernible sequence I = : % < to) 
of |x|-tuples such that for any n-tuple c = (c ii; . . . ,c in ) from I there is a = 
(ai, . . . , a re ) from A having the same type as c. This implies that (IP),/,,/ holds. 
In particular, there is b such that 0(cj; b) holds if and only if i is even. □ 

It follows by compactness that if 4>(x; y) is NIP, then there is a finite set A 
of formulas and an integer N such that the following does not exist: 

- (at : i < N) a A-indiscernible sequence of |x| -tuples; 

- b a |y|-tuple, such that -i(0(aj; b) -H- </>(a i+ i; b)) holds for i < N — 1. 

Let / = (aj : z G X) be an indiscernible sequence, and let 0(x; 6) e -^(W) be 
a formula. Then there is a maximal integer n such that we can find io <x ■ ■ ■ <x 
i n with -i(0(a ifc ; b) <-> 0(a ifc+1 ; 6)) for all k < n. We call such an n the number 
of alternations of 6) on the sequence I and write it as alt(0(x; b), I). We 
let alt(0(x; y)) denote the maximum value of alt(0(x; b),I), for 6 ranging in U 
and / ranging over all indiscernible sequences. Note that this maximum exists 
and is bounded by the number N of the previous paragraph. 

Proposition 2.6. The formula 4>(x;y) is NIP if and only if for any indis- 
cernible sequence (aj : i G X) and tuple b, there is some end segment X C X 
and 7] G {0, 1} such that 0(aj; 6) r ' holds for any i G X . 

Proof. If X has a last elements io, let X = {io}- Otherwise, this follows im- 
mediately from Lemma [2.41 □ 

Lemma 2.7. A boolean combination of NIP formulas is NIP. 

Proof. It is clear from the definition that the negation of an NIP formula is 
NIP. 

Let <f)(x;y) and ip(x;y) be two NIP formulas and we want to show that 
9(x;y) = <fr(x;y) A ip(x;y) is NIP. We use the criterion from Proposition 12.61 
Let (dj : i £ X) be an indiscernible sequence of |x|-tuples and let b be a \y\- 
tuple. Let C X be an end segment such that 0(a«; b) <H- 0(%; b) holds for 
i,j G X^,. Define X^, similarly. Then let X =1^(11^. Then X is an end 
segment and we have 8(af,b) -H- 6(aj]b) for i,j G X . This shows that 9(x;y) 
is NIP. □ 
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2.1.1 The strict order property 

Definition 2.8. We say that a formula <fr(x;y) has the strict order property 
(SOP) if there exists a sequence (6j : i < u) of |y|-tuples such that for all i < cu, 

Remark 2.9. For the theory T, it is equivalent to say that some formula has 
SOP, or to say that there is, in T eq , a partial definable order with infinite 
chains. (If 4>(x;y) has SOP, then the formula ^(yx,^) = (Vx)</>(x; yi) — > 
4>(x; 1/2) defines an order with infinite chains.) 

Theorem 2.10. Assume that T is unstable. Then at least one of the following 
holds: 

- there is a formula <f>(x; y) which has IP; 

- there is a formula <f>(x; y) which has SOP. 

Proof. Let <p(x; y) be unstable and NIP. There is some indiscernible sequence 
I — (aj : % < to) such that for every N < u, we can find b^ such that 4>(af, b) 
holds if and only if % < N. By NIP, there is some integer n and r] : n — >■ {0, 1} 
such that 

A y) v{l) 

i<n 

is inconsistent. Starting with that formula, we change one by one instances of 
-'4>(a i ; y) A(f>(a i+1 ; y) to 0(aj; y) f\—>4>( a i+i] y). In the end, we arrive at a formula 
of the form f\ i<N <fi(ai] y) A/\ 7V<i<n 0(a«; y). The tuple b]y satisfies that formula. 
There is therefore one step in the process in which we change an inconsistent 
formula to a consistent one. Namely, there is some io < n, ry : n — )• {0, 1} such 
that 

f\ 0(a t ; y)"« A ^<j>(a io] y) A 0(a, o+1 ; y) A /\ 0(a,; y)"« 

i<io io+l<i<n 

is inconsistent, but 

f\ 0(a,; y)"« A 0(a lo ; y) A -0(a io+1 ; y) A /\ 0(a i5 y)"« 

i<io £o+l<i<n 

is consistent. Write those formulas as 9(a;y) A -^(f>(a io ; y) A (f)(a io+ i; y) and 
0(a; y) A <j>(a io ; y) A -x/>(a io+ i; y) respectively. 

Increase the sequence / to I' — (a* : i £ Q), indexed by Q. Then, for 
^0 < £j i' < &o + 1) we have: (By) 6 (a; y) A 4>( a i\ y) A -i0(aj/, y) holds if and only 
if z < z'. It follows that the formula ifj(y;x) = 6 (a; y) A 0(x; y) has the strict 
order property. □ 
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2.1.2 Counting types 

We give another characterization of NIP by counting types. This characteriza- 
tion however relies on set-theoretic assumptions, and will not be used later in 
this text. 

Fix some formula (f)(x;y). The stability function for is the function 
defined on cardinals by g^(n) = sup{|S , 0(A)| : A of size k}. Recall that S^(A) 
denotes the set of 0-types over A. 

If 4>(x;y) is stable, than there is some integer n such that g^{n) < n n for 
every (finite or infinite) cardinal k. Conversely, if g^K) = k for some infinite 
cardinal k, then <fi is stable. (See Appendix iBl ) 

If (f)(x;y) has IP, then easily, we have g^n) = 2 K for every cardinal k. 
Conversely, it follows from the definition that if g^n) = 2 K for every finite k, 
then has IP. 

The case of an unstable NIP formula 4>(x; y) is trickier. We will see in 
Chapter [6j that there is n such that for any finite cardinal k, we have g$(n) < 
n n . If k is infinite and 2 K = k + , then as is unstable, we must have g^{n) = 2 K 
so there is no hope of separating IP from NIP by counting types over infinite 
sets without extra set-theoretic assumptions. 

Definition 2.11. For a cardinal A, we define ded(A) = sup{ft : there is a linear 
order of size k which has a dense subset of size A}. 

Proposition 2.12. Let <p(x; y) be a formula. Assume that there is some infinite 
set A with IS^A)] > ded(|A|) ; then 4>(x;y) has IP. 

Proof. Assume that |jS^(A)| > ded(|A|) and that A is chosen such that \i = \A\ 
is minimal. Let A = ded(|v4|) + . Enumerate A as {ai : i < ji} and for i < /i, set 
A{ = {aj : j < i}. For each % < /i, we then have |S^(Aj)| < dedd^LJ) < A. 
Define the following sets: 

Si = {p G S^(Ai) : p has > A extensions to S^A)}, for i < 
S^ = {p e S^A) : p \ Ai G St for all i < //}. 

Note that for every i < fi, as |5^(Aj)| < A, we have {S^Ai) \ Si\ < A and 
as each type from that set has less than A extensions to a type over A, the 
cardinality of {p G S^(A) : p \ Ai £ Si} is less that A. Summing over i < fi, we 
see that 

|£V(A)\^|<A. 
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It follows that every type in Si has at least A extensions to a type in S M . 

Let S <fl = Uj<At ^ anc ^ = *^<m ^ *-V We define a linear order on S< M 
in the following way For p, q G S< M , if p C g (resp. q C p), we set p < g (resp. 
q < p). Otherwise, let z < /i be maximal such that p \ Si = q \ Si. Then set 

p < q if p I cti) (which implies g h 0(x;aj)) and p > q otherwise. We 

leave it to the reader to check that this indeed defines a linear order with S <fl 
as a dense subset. 

We now show: 

(*) For all n < uj and q G S< M , there are tuples b q , . . . , G A such that 
for every r/ : n — )■ {0, 1}, the type q(x) A f\ k<n 4>( x l b^) 11 ^ is consistent. 

This will imply that <fr(x;y) has the independence property. We show (*) 
by induction on n. For n = 0, there is nothing to prove. 

Assume the result is known for n, and we prove it for n + 1. Let q G Si 
for some i < fi. Define = {p £ S< M : p \ Si = q}. Using the order defined 
above, we see that S q PI S <A1 is a dense subset of S q . We know that \S q \ > A, 
therefore by definition of ded, \S q PI S< M | > /i. It follows that there is some 
i < fi such that \S q fl Sj| > /i. The induction hypothesis gives, for every type 
p G S q C\Si, a tuple (bp, . . . , frp -1 ). We can find two distinct types pi,^2 £ S g nSj 
for which the corresponding tuples are the same, equal to some (6 , • • • , b n -\). 

Let 6 n G Aj be such that pi h 6 n ) and P2 I b n ) (exchanging the roles 

of pi and P2 if necessary). Then for every rj : n + 1 — > {0, 1}, the partial type 
q(x) A [\k <n (f>{x] bk) v ^ is consistent. This finishes the induction step, and the 
proof. □ 

For any infinite cardinal k, we have k < ded(n) < 1 R . Hence if 1 K = k + , 
then ded(ft) = 2 K and the hypothesis of the previous proposition cannot be 
satisfied. However, Mitchell has shown in [63] that if cf(n) > K , then there is 
a cardinal-preserving forcing extension of the set-theoretical universe on which 
ded(/t) < 2 K . In a such an extension, a formula <fr(x; y) is IP if and only if 
9M = 2 K - 

2.1.3 NIP theories 

Definition 2.13. The theory T is NIP if all formulas <p(x\ y) are NIP. 

Proposition 2.14. Assume that all formulas 4>(x;y), where \x\ = 1 are NIP, 
then T is NIP. 

Proof. Assume that all formulas <f)(x; y) where \x\ = 1 are NIP. 
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Claim : Let (foj : i < \T\ + ) be an indiscernible sequence, and let a G U, 
\a\ = 1. Then there is some a < |T| + such that the sequence (pi : a < i < \T\ + ) 
is indiscernible over a. 

Proof: Otherwise, there is some formula 8(x; y±, . . . , y^) such that for all 
a < |T| + , we can find some 

a < i\ < . . . < ik < \T\ + and a < j\ < . . . < jk < \T\ + 

such that S(a,bi i , . . . , b ik ) A-><5(a, b^, . . . , bi k ) holds. It follows that we can find a 
sequence / = . . . "i l k : / < cu) such that i[ < . . . i l k < i 1 ^ 1 for all Z < u and we 
have 5(a, b^, . . . , bji) if and only if / is even. As the sequence / is indiscernible, 
this contradicts NIP. 

Now let 4>(x;y) be any formula, with x = (x\, . . . ,x n ). Let (pi : i < \T\ + ) 
be an indiscernible sequence and let a = (ai, . . . ,a n ) be any tuple. By the 
claim, there is some «i < |T| + such that the sequence (pi : a\ < i < \T\ + ) is 
indiscernible over a\. This implies that the sequence (bfai : a\ < i < \T\ + ) is 
indiscernible. Therefore another application of the claim gives some «2 < \T\ + 
such that the sequence (bfai : «2 < i < \T\ + ) is indiscernible over ci2- Iterating, 
we find a n < \T\ + such that (pi"ai . . . a n : a n < i < \T\ + ) is indiscernible. This 
implies that the truth value of 0(a; bi) is constant for i > a n . Therefore the 
formula <fr(x; y) is NIP. □ 

Example 2.15. The following theories are NIP: 

- any stable theory; 

- any o-minimal theory; 

- any C-minimal theory, for example ACVF, the theory of algebraically 
closed valued fields; 

- Th(Q p ); 

- any pure tree. 

See Appendix |A] for the definitions of some of those theories and extra 
details. 

We give an example of how the definition of the independence property can 
be used. 

Theorem 2.16 (Baldwin-Saxl) . Let G be a group definable in an NIP theory 
T. Let H a be a uniformly definable family of subgroups of G. Then there is 
an integer N such that for any finite intersection HaeA there is a subset 
A C A of size N with f) a£A H a = f) aGAa H a . 
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Proof. The fact that H a is a uniformly definable family of subgroups means 
that there is a formula <p(x; y) such that for any a, <f>{x\ a) defines a subgroup 
of G. Take any integer N, and assume that the conclusion of the theorem does 
not hold for N. Then we can find some set A = {do, . . . , ajv} of parameters 
such that for every k < N, we have flae^YK} H a ^ V\ aeA H a- Let K k = 
C\aeA\{a k } H a and K = C\aeA H a- For ever y k < N, pick a point c k E K k \K. 
For BciV + 1, define c# = IlfeeB Cfc ' w here the product is in the sense of the 
group G. Then we have 

c B eH k k(£ B. 

This shows that the formula ip(y; x) := <p(x; y) has VC-dimension at least N. 
Therefore there is a maximal such N. □ 



2.2 Invariant types 

Assumption: From now on, we assume that the theory T is NIP (except in 
Proposition 12.311 where we give an additional characterization of NIP). 

Definition 2.17 (Invariant type). Let A <zU and p G S X (U). We say that p 
is A-invariant if ap = p for any a 6 Aut{U/A). 

We say that p is invariant, if it is A- invariant for some small A <zlA. 

Another way to phrase the definition is to say that p is A-invariant if for 
every formula (f)(x; y) and tuples b, b' G U, if b =a b', then 

p h (j)(x; b) <^=^> p h (j)(x; b'). 

Example 2.18. A global type p G S X (U) is said to be definable over A if for 
every formula 4>(x; y) without parameters, there is some formula d<p(y) G L(A) 
such that p h 0(x; 6) <^=^> 6 |= for all b G W. Any A-definable type 

is clearly A-invariant. We will say that a global type p is definable if it is 
A- definable for some A. 

Example 2.19. A type p is said to be finitely satisfiable in a set A if for every 
formula 4>(x;b) G p, there is a G A such that 4>(a;b) holds. If p G S X {U) 
is finitely satisfiable in A, then it is A-invariant. A global type p is finitely 
satisfiable if it is finitely satisfiable in A for some small A. We present two 
constructions to obtain such types. 
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(1) Let A C ti be any set and let T> be an ultrafilter on A^. We define 
Pv G by: 

W I- 6) 6) g D, 

for every formula 6) G L{U). Every type finitely satisfiable in A is equal to 
Pt> for some ultrafilter T>. In particular note that if we take T> to be a principal 
ultrafilter, then we obtain a realized type. 

(2) Let / = (aj : i G X) be an indiscernible sequence. Then by Proposition 
12.61 the sequence (tp(aj/W) : i G I) converges in S'(W) to some type, called the 
limit type of the sequence /, denoted by lim(7). This type is finitely satisfiable 
in /, and indeed in any cofinal subsequence of /. 

If po G S X (M) is a type, then a coheir of p^ is a global extension p which is 
finitely satisfiable in M. Such a coheir always exists as can be seen by extending 
{0(M; a) : (p(x; a) G po} into an ultrafilter T> on M' x ' and using considering 
Pv- 

Remark 2.20. Let p G S^W) be an A-invariant type. Let M D A be 
saturated and define q = p\ M . Then q has a unique extension to a type over 
W which is A-invariant. (Let p be such an extension, and consider a formula 
4>(x;b) G L(U), then there is 6' G M such that b =a b'. By A-invariance, we 
have p h 6) <^=^ p h <^(i; &') ^=^> p h 6) <^=^ p\- <p(x\b').) 

In particular, if p G 5"(W) is an invariant type, and if V D W, then there 
is a unique extension of p to a type over V which is invariant over some small 
A dlA. We will denote that extension by p\V. 

Lemma 2.21. Let p G S{U) be an A-invariant type, then: 

- if p is definable, it is A-definable; 

- if p is finitely satisfiable, it is finitely satisfiable in any model M D A. 

Proof. Assume that p is definable. Let (j)(x; y) be any formula and let d<f)(y) G 
L{U) be such that for any b G U, p h <p(x;b) <^=^ b \= d<ft(y). Then the 
definable set d<fi(y) is invariant under all automorphisms that fix A pointwise. 
It follows that it is A-definable. 

Assume now that p is finitely satisfiable in some small model N. Let M be 
any small model containing A. Let 4>(x; d) G L{U) be any formula and let N\ 
realize a coheir of tp(7V/M) over Md. Then by invariance, p is finitely satisfi- 
able in N\ and in particular (j)(Ni; d) is non-empty. By the coheir hypothesis, 
0(M; d) is non-empty. Therefore p is finitely satisfiable in M. □ 
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2.2.1 Products and Morley sequences 

Let p x , q y G S(U) be two invariant types. We define the type p x (g) q y G S xy (lA) 
as tp(a, b/U) where b \= q y and a \= p\Ub. 

Another way to say the same thing, if p and q are A-invariant: p x (g) q y h 
if p x h </>(x; 6) for some (any) b EU with b \= q y \A- 

The relation <g> is associative, but in general not commutative. 

More generally, if p x is M-invariant, N D M and q y G S'(iV) is any type, 
then we can define p x ®q y G S xy (N) as tp(a, b/N) where b \= q y and a \= p \ Nb. 

Example 2.22. Let T be DLO, and take p = q to be the type at +oo. Then 
Px ® q y \~ x > y whereas q y (g) p x h x < 

Lemma 2.23. //px is a global definable type and q y is finitely satisfiable, then 
p x ®q y = q y ®p x . 

Proof. Let M -< W be a model such that p is definable over M and q is finitely 
satisfiable in M. Let (p(x,y; d) G L{U). By definability of p, there is a formula 
d(f>(y; z) G L(M) such that for every 6, <f G W, we have p h <f)(x, b; d') -<=>- b |= 
c/0(x; <i'). Let r\ G {0, 1} such that q h c?0(x; d') 1 *. 

Without loss, assume that g y tg> p^ h <p(x,y;d). Let a |= p. Then g y |Wa h 
0(a, rf) and as q is finitely satisfiable, there is b G M such that 6 |= </>(a, y; d) A 
d)^. Then p h 6; d) A (i</>(6; (i)' 7 and by definition of dcj), this implies 
that r) — 1. 

Therefore g h <i(/>(?/; d) and it follows that p x ® g y h </>(x, d). □ 
If Pa is an A- invariant type, we define by induction for n G N*: 

Pi 1 } = Px and p^+% n = p Xn <g> p£!...,xn-! • 

Let also pio^i,--- 

= U^ (n) - For any JV D A, a realization ( Gi : i < w) of pH^ 
is called a Morley sequence of p over TV (indexed by u). Such a sequence 
(di : i < a;) is indiscernible over iV (using associativity, we can check by 
induction on n that for any ij, . . . , i n G w, we have tp(a il , . . . , a in /N) = p^|j\r)- 
More generally, any sequence indiscernible over iV whose EM-type over iV 
is given by (p^|jv : 1 < "ft < w) is called a Morley sequence of p over A 7 ". 

Exercise 2.24. If p is a global A-definable type (resp. finitely satisfiable in 
A), then p^ is also A-definable (resp. finitely satisfiable in A). 
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Let I = (a t : t G X) be an endless indiscernible sequence. We have defined 
the limit type p = lim(7) as a global invariant type. Let J be any linear order 
and (ct : t G J) be a sequence such that c t \= p \ I + c >t . So J is a Morley 
sequence of p over I indexed backwards. Then I + J is indiscernible. 

An application to definable groups 

Lemma 2.25. Let G be a definable group. Let p be a non-realized invariant 
type concentrating on G and (a, : i < u) a Morley sequence of p. Assume 
that for every i,j < u, a^aj = a^, then there is an infinite abelian definable 
subgroup H of G, such that p concentrates on H . 

Proof. First, we note that it is enough to find some infinite definable set D C G 
such that p concentrates on D and any two elements of D commute. Indeed, 
if we have such a set D, we let H = C(C(D)), where C(X) = {y G G : yx = 
xy for all x G X}. Then if is a definable abelian subgroup containing D. 

We try to build by induction a sequence (a n b n : n < u) such that a n \= p \ 
a<nb< n , b n \= p \ a <n b <n and a n b n ^ b n a n . Assume that we succeed. Then by 
hypothesis, a n b m = b m a n for n ^ m. For any I C u finite, define bj = Ylnei ^n- 
We have thus, a n bj = bja n if and only if n (jt I. This contradicts NIP. 

Therefore the construction must stop and for some n, we cannot construct 
a n , b n as required. Then if we let q = p \ a <n b <n , we have q(x) A q(y) — > 
xy = yx. By compactness, we can replace q by some finite part go C?. As p 
is a non- realized type, the definable set qo(x) is infinite and has the required 
property. □ 

Proposition 2.26. Let G be a definable group, and assume that there is an 
infinite subset A C G such that any two elements of A commute, than there is 
an infinite abelian definable subgroup of G. 

Proof. Let T> be an ultrafilter on A and consider the average type px> of that 
ultrafilter. Then px> satisfies the conditions of the previous lemma. □ 

Note that if G is torsion free, then the proposition is easy (and does not 
require NIP): take H = C(C(a)) for any a G G \ {1}. 

2.2.2 Eventual types 

In a stable theory, an invariant type p is the average of any one of its Morley 
sequences. This is no longer true in general for NIP theories. We show now 
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that we can nevertheless recover an A-invariant type from its Morley sequence 
over A. 

Proposition 2.27. Letp,q G S X (U) be A-invariant types. If p^'\a — Q \a> 
then p = q. 

Proof. By assumption, if J is a Morley sequence of p over A (indexed by any 
linear order) then it is also a Morley sequence of q over A. Let <fr(x; d) G L{U) 
be any formula and let / = (a, : i G I) be a Morley sequence of p over A such 
that alt(0(x; d), I) is maximal. Let rj G {0,1} be such that lim(7) h (^{x^d) 71 . 
Now let a |= p\id- Then the sequence / + (a) is also a Morley sequence of p 
over A. By maximality of 7, we must have alt(0(x; d), I) = alt(0(x; d), 1+ (a)). 
This implies that a \= 4>(x; d) v . Therefore p h <p(x; d) v and as the roles of p 
and q are interchangeable, we also have q h rf)^. Thus p = q. □ 

Definition 2.28. Let A dlA and let I be an ^-indiscernible sequence. We say 
that I is based on A if for all A-indiscernible sequences I±, I2 of same EM- type 
as I over A, there is some tuple a such that I\ + (a) and I2 + (a) are both 
indiscernible over A. 

Let / be any indiscernible sequence based on A. Let <p(x; d) G L(IA) be any 
formula. Let I\ =% I an d -^2 =a I such that alt(0(x; d), Jj) is maximal, for 
% G {1, 2}. Let r/j, for z = 1, 2 be such that lim(/j) h <p(x; d) Vi . As J is based on 
A, there is some tuple a such that both I\ + (a) and I2 + (a) are indiscernible 
over A. By maximality of I±, we have alt(0(x; d), I± + (a)) = alt(0(x; d), I\) 
and a |= <f)(x; d) m . Similarly we have a \= (f>(x; d) m therefore 771 = 772. 

We now define a global type Ev(I/A), called the eventual type of the se- 
quence I over A. For this, let <p(x;d) G L{U). Pick some J =a I such 
that alt(0(x; d), J) is maximal, and set Ev(I/A) h <p(x;d) if and only if 
lim(J) h <ft(x;d). By the previous argument, this does not depend on the 
choice of J. It is clear that this defines a consistent, complete type over IA. 
Also, Ev(I/A) only depends on the type of I over A (indeed of its EM-type) 
therefore Ev(I/A) is an A-invariant type. 

Another characterization of Ev(I/A) (which explains its name) is the fol- 
lowing: for any formula 0(x; d) G L(U), we have Ev(I/A) h <p(x; d) if and only 
if for every sequence J =a M one can increase J to some A-indiscernible 
sequence J' with lim(J') h <p(x; d). 

Example 2.29. Let T be DLO, and consider an increasing sequence 7 = (aj : 
i < uj). Let 62 < b\ be two points greater than all the a^s. Set A\ = {b\} and 
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A2 = {b\, 62}, then I is indiscernible both over A\ and A?,. The type Ev{I/A{) 
is the definable type "6^" whereas Ev{I /A2) is the type "63 "■ This shows that 
changing the parameter set A may modify the eventual type Ev(I/A). 

Let I be an indiscernible sequence based on A. We show that / is a Morley 
sequence of Ev(I/A). Without loss, we may assume that I — (<2j : % < uS). It is 
enough to show that for each n < 00, we have a n \= Ev(I/A) \ Aa <n . We know 
that we can increase I to an A-indiscernible sequence J such that lim(J) | 
Aa <n = Ev(I/A) \ Aa <n . By indiscernability of J, a n \= lim(J) \ Aa <n which 
gives what we want. 

Proposition 2.30. There is a bijection between EM-classes over A of indis- 
cernible sequences based on A and A-invariant types. 

Proof. From left to right, we map an indiscernible sequence / to Ev(I/A) which 
only depends on the EM-type of / over A. In the other direction, we map an 
A-invariant type p to the EM-class of its Morley sequence. □ 

Proposition 2.31. The theory T is NIP if and only if for all M \= T , for all 

p G S(M), p has at most 2' M ' + ' T ' global coheirs. 

Proof. We show left to right. Assume that T is NIP and let M (= T, then 
by Proposition 12.271 an M-invariant type is determined by p^\m- There are 
at most 2' M ' + ' T ' values for that type, so there are at most 2' M ' + ' r ' global M- 
invariant types. The result follows. 

Conversely, assume that T has IP. Then there is a set {a, : i < A} of finite 
tuples, for A = |T| + , and a formula 4>(x;y) such that for any A C A, we can 
find some bA such that: 0(aj; 6a) i G A Let M be a model of size A 

containing all the a^'s. For D an ultrafilter over A, we define p-p as the average 
of tp(aj/W) along V. Then we have px> l~ <j>(x; 6^) < ^ = ^ > A e V. This shows 
that for T> 7^ T>', the two types pp and px>' are distinct. Furthermore, px> is 
finitely satisfiable in M. As there are 2 2 pairwise distinct ultrafilters on A, we 
have obtained 2 2 global types finitely satisfiable in M. Since there are only 
2 A types of the same arity over M, there is at least one such type which has 
2 2 global coheirs. □ 

2.2.3 Generically stable types 

Lemma 2.32. Let I = (a^ : i G X) be a totally indiscernible sequence, and 
4>(x;b) G L{U) a formula (\x\ = \a,i\) } then the set {i G X :|= 0(aj; b)} is finite 
or cofinite in X. 
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Proof. Otherwise, we can build a sequence (z& : k < oj) of pairwise distinct 
members of X such that i k G X if and only if k is even. Then the sequence 
(a,i k : k < u) is indiscernible and the formula 4>{cii k ; b) holds if and only if k is 
even. This contradicts NIP. □ 

Note that it follows from the proof that the cardinality of {i G X :|= 0(aj; b)} 
or of its complement is bounded by alt(0(x; y))/2. 

Theorem 2.33. Let p be a global A-invariant type. Then the following are 
equivalent: 

(i) if I \=p^\a, then p = lim(7); 

(ii) p is definable and finitely satisfiable; 
(Hi) p x ®Py=p y <8>p x ; 

(iv) any Morley sequence of p is totally indiscernible. 

Proof, (i) =>- (ii): Assume (i). Then p is finitely satisfiable in I. Also, let 
4>(x; y) G L and let b G U. Then p h b) if and only if there is some 
sequence (a k : k < u) \= p^\a such that (f)(a k ;b) holds for all k. Therefore 
{b G U : p h is a type-definable set. The same holds for ->0 instead 

of <p, so the complement of that set is also type- definable. It follows that it is 
definable. Therefore p is a definable type. 

(ii) =>• (iii): Follows from Lemma [2.231 

(iii) =4> (iv): By associativity of <S>, we have that for every n and a G Sym(n), 
p(x n _i) <E> . . . ®p(x ) = p(av(„_i)) ® . . . <S>p(x a [p))- It follows that the Morley 
sequence of p is totally indiscernible. 

(iv) (i): Let </>(x; b) G L(U) and J |= pM\ A . Let J (= p (w) U/fe- Then I+J 
is a totally indiscernible sequence. Let e G {0, 1} be such that p h 0(x; 6) e . 
Then all points of J satisfy 0(x; 6) e . Therefore by Lemma f2.32[ at most a finite 
number of points of / satisfy _i 0(x; b) e . It follows that lim(7) h 4>{x\ b) e . □ 

Definition 2.34. An invariant type satisfying the equivalent conditions of 
Theorem 12.331 is called generically stable. 

Example 2.35. I. Let (M; <, A) be a model of the theory T dt of dense trees 
as defined in Appendix [A] Let c G M. We define the generic type of the closed 
cone centered in c as the global type p c (x) satisfying x > c and ->(x < b) for any 
b G U, b > c. This is a complete, c-invariant type. Its Morley sequence (b k : 
fc < w) is a sequence of pairwise incomparable elements, such that b k Abi = c for 
any k ^ I. Using quantifier elimination, we see that it is a totally indiscernible 
sequence. Therefore p c is generically stable. 
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2. Consider the language L = {R n (x,y) : n < u} and the L-structure M 
whose universe is Q and such that M \= R n (x,y) if and only if x < y and 
\x — y\ < n. This is sometimes called a "local order". 

Let p(x) be the global 0-invariant type satisfying ->i? n (a, x) A -*R n (x, a) for 
all aeW. Then this type is generically stable. 

Remark 2.36. If p is generically stable, then it is definable and finitely satisfiable 
over any one of its Morley sequences. (Because if 7 is a Morley sequence of 
p, then p = lim(J) is finitely satisfiable in /, furthermore it is / invariant, so 
it is definable over I). We can write a definition explicitly. Let 4>(x;y) be a 
formula, and let N = alt ((f)). If (a^ : k < u) is a Morley sequence of p, and 
b GW, then 



Notice in particular that the form of the formula giving the 0-definition depends 
only on 0. 

Proposition 2.37. Let p be generically stable and q any invariant type. Then 
p x ®q y = q y ®p x - 

Proof. Assume for a contradiction that for some formula <f)(x; y) G L{U) we 
have p x <E> % \~ <f)(x;y) and q y <E> p x l~ ^{.^y)- Let (a^ : k < u) \= p^\ 
b \= q\Ua <UJ and (a& : u < k < u2) |= p^\Ua <w b. Then for k < oj, -i^a^b) 
holds and for k > u, we have <j)(ak', b). As the sequence (a*, : k < u2) is totally 
indiscernible, this contradicts Lemma 12.321 □ 

Lemma 2.38. Let p be generically stable, then for any integer n, p^ n ' is also 
generically stable. 

Proof. One checks easily that a product of definable types is definable, and a 
product of finitely satisfiable types is finitely satisfiable. Therefore a product 
of generically stable types is again generically stable. □ 

Proposition 2.39. Let p be a generically stable, A-invariant type. Then p is 
the unique A-invariant extension of p\a- 

Proof. Notice that if q is any A-invariant type and r, s G S{U) are such that 
r\ A = s\ A , then (q <g> r)\ A = (q <g) s)\ A . 
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Let q be any A- invariant extension of p\ A . We will show by induction that 
9 U = P \a- It follows by Proposition 12.271 that p = q. The case n = 1 is 
the hypothesis on g. Assume that q^\ A = P^\a- We have: 

q {n+1) (x 1 ,...,x n+1 )\ A = (q(x n+1 )®q ( - n) (x 1 ,...,x n ))\ A 

= (<?On+l) ®p {n) (x U . . . ,X n ))\ A 

= (p {n) (x u . . . ,x n ) ® g(x n+ i))U 

= (n) 0l, • • • ,^n) ®p(x n+1 ))\ A 

= p { - n+1 \x l ,...,x n+1 )\ A . 

□ 

Historical Remarks 

The definition of NIP and most results in Section 12.11 are due to Shelah and 
appeared first in [78] (see also [51] [II .4]). Lemma [2.51 comes from Poizat's [70] . 
The characterization 12.311 also appears in that paper, but the proof we give 
here is from [80J. Eventual types are defined in [71], although the terminology 
was introduced by Adler in [1]. Proposition 12.141 was first proved by Shelah 
using the ded(A)-characterization and the fact that the statement is absolute, 
see the discussion in [72] [Chapter 1 2] . The proof we give here is very close to 
Poizat's approach in [72] [12.18], and was first published by Adler in [1]. 

The Baldwin-Saxl theorem is from [12]. See also [69]. Proposition 12.261 was 
proved by Shelah in [83] . 

Generically stable types are first defined by Shelah as 'stable types' in [52] . 
They are renamed and studied systematically by Hrushovski and Pillay in [37] 
and independently by Usvyatsov in [89]. This notion was extended outside of 
the NIP setting by Pillay and Tanovic in [68] . 



Chapter 3 



Honest definitions and 
applications 



A fundamental characterise property of stable theories is definability of types, 
namely the property that if A C U is any subset (big or small), and 0(x; b) G 
L(U), a formula, then the set <fr(A; b) coincides with the trace on A of some 
A-defmable set: there is ip(x; c) G L(A) such that <fr(A; b) = tfj(A;c). In other 
words, we can internalize the parameters of <p inside A (up to changing the 
formula). In this chapter, we show that a weak form of definability of types 
holds in NIP theories. We do not manage to find a definition of <f)(A; b) with 
parameters inside A, but we do with parameters in some elementary extension 
A' of A. Furthermore, the definition satisfies a property called honesty which 
says that, on A', the new formula lies inside the original one. 

Associated with definability of types are so-called reflexion principles stat- 
ing that if some (e.g. type-definable) set A is internally simple in some sense, 
then it is also externally simple: its interactions with the rest of the structure 
cannot be too complicated. We obtain some statements of this kind using hon- 
est definitions. Note that the characteristic property 12.51 of NIP is an example 
of such a phenomenon. It says that if a sequence / is indiscernible (internal 
simplicity), then the intersection of / with some unary W-definable set is a 
finite union of convex subsets of / (external simplicity). We will in fact extend 
this result to definable sets of higher arity (shrinking of indiscernibles). 

Throughout this chapter, we assume that T is NIP. 
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3.1 Honest definitions 

Pairs and definable types 

Let M be an L-structure and let A C M. We define the language L-p = 
L U {P(x)}, where P(x) is a new unary relation symbol. We will sometimes 
write x G P instead of P(x). 

The pair (M, A) is the Lp-structure whose L-reduct is M, and such that 
P(M) = A. 

Let A C M be any subset and let 6 G M fc and 4>(x; y) G L. We say that 
the 0-type of d over A is definable if there is a formula if)(x; z) and some tuple 
d £ A k such that b) = xjj(A; d). Note that this is expressible in the pair 
(M, A) by the Lp-formula Va; G P <p(x\ b) <H- i[)(x; d). 

In a stable theory, any type over any set is definable (see Appendix IB]) . 
This is not true in an unstable theory, however we will see now that a weaker 
property holds in NIP theories. 

Honest definitions 

Theorem 3.1. Let M \= T, A C M, <fi(x;y) G L a formula and b G a 
tuple. Then there is an elementary extension (M, A) -< (M',A'), a formula 
i/j(x; z) G L and a tuple d of elements of A' such that 

<j){A-b) C ^(A';d) C <f)(A';b). 

Remark 3.2. Note that the conclusion is equivalent to saying that there is a 
formula ip(x; z) such that for any finite A C <p(A;b), there is a tuple of 
elements of A such that A C ^(A; <i) C 6). 

Example 3.3. Take for example T to be DLO, M = (R, <) and A = Q. 
Let <f)(x;b) be x < ir and let ip(x;z) = x < z. We see that for any finite 
Aq C 0(y4;6), there is some point d E A such that Aq C ^(A; <i) C 0(A;6). 
Simply take (i to be a rational smaller than tt, but greater than all the elements 
of A . 

Proof of Theorem \3.1\ Let Sa C S x {U) be the set of global types (in the vari- 
able x), finitely satisfiable in A. It is a closed set of S X (U), and therefore 
compact. Let (M', A') y (M, A) be a |M| + -saturated elementary extension. 
Let p E Sa- We try to build a sequence (a^ : i < uj) such that for all i we have 
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- dj e A; 

- di \= p \ Aa Ki ; 

- -■(0(a i ; d) <H- 4>(a i+1 ; d)). 

If we succeed, then the sequence (a* : i < u>) is a Morley sequence of p over 
A and as such it is indiscernible. Then the third condition implies that has 
infinite alternation rank, contradicting NIP. We conclude that the construction 
must stop at some finite stage. So assume we have built (a* : i < n), and we 
cannot find a point a n . Let e p G {0,1} be such that p h 0(x;6) £p . We first 
argue that 0(a n _i;6) £p holds: the type p is finitely satisfiable in A, therefore 
for every subset B C M' of size < |M|, the Lp-type p\b{x) U {P(x)} is finitely 
satisfiable in A and thus realized in A'. Taking B to contain da <n we see that 
if |= -i0(a n _i; b) ep , we could find required. 

By compactness, there is a formula 9 p (x) G L(Aa <n ) such that p h 9 p (x) 
and (M',A') |= (Vx G P)6 , p (x) — >■ 4>(x;dY p . As is compact, we can find a 
finite set C Sa such that \J pe s 9 p (x) covers Sa- Set 

1>{x) = V 

€ p =l 

Write ip(x) = ip(x;d) where d G A' are the parameters appearing in ip. We 
show that xjj has the required properties. First, for all type p G Sa such that 
p h </>(x; 6), we have p h ip(x\ d'). In particular, this is true for realized types. 
It follows that (j)(A;b) C Furthermore, we have (M',A') \= (Wx G 

P)^(x; d) ->■ (j)(x; d), therefore tp(A', d) C 6). □ 

Definition 3.4. We say that the formula ip(x; d) in the previous theorem is 
an honest definition of (p(x; b) over A. 

The following corollary is simply the weakening of the theorem obtained 
by removing in the conclusion the "honesty" hypothesis ifj(A';d) C <f)(A';b). 
We state it separately because it might seem more natural and because it is 
sufficient for some applications. We will refer to it as weak stable- embeddedness 
of the set A. 

Corollary 3.5 (Weak stable-embeddedness). Let M \= T, A C M, (f)(x;y) G L 
and b G M^L Then there is an elementary extension (M,A) -< (M',A'), a 
formula ip(x; z) G L and a tuple d G A' such that 



( p(A-b) = ^(A ] d). 
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Corollary 3.6. Let M \= T and ACM. Let b £ M be a finite tuple. Let 
(M,A) -< (M',A') be an \M\ + -saturated extension. Then there is A Q C A' of 
size at most \T\ such that for any two tuples a and a' from A we have: 

a =Ao a ' == ^ a —b a' '• 

Proof. For any formula (f>(x; y), \y\ = \b\, let tjj^(x; d^) be an honest definition of 
<j)(x; b) over A where G A'. Take A to contain the union of the parameters 
dfj, for ranging over all formulas. □ 

Examples 

Let A C M be any subset and let A ind ^ be the structure in the language 
La,<d = {R<j,(x)(x) '■ ^ whose universe is A and where each R$(x) is 

interpreted the obvious way: for every a G A, A ind ^ \= R$(a) •<=>• M |= 0(a). 

Proposition 3.7. Let A C M. Assume that in Ai n d(<b), the quantifier-free 
formulas are stable. Then A is stably- embedded in M. 

Proof. By weak stable-embeddedness of A (Corollary 13. 5p . there is (M',A') y 
(M,A) and ip(x;d) G L(A') such that (f>(A;b) = ip(A;d). Note that we have 
^4md(0) -< ^i n d(%) an< ^ "0(^j ^) = R-tp(x;z) (A; d) . We work inside the structure 
A' ind ^y As the formula R^ x;z ) (x; z) is stable, the set R^ x;z ^(A; d) is defin- 
able by some .R^-formula (in particular quantifier-free) with parameters in A 
(see Appendix [B]) . This translates into some formula 8(x; c) G L(A) such that 
9(A; c) = 4>(A; d) as required. □ 

Exercise 3.8. Let D C M be a ^-definable set. Let <f>{x,y) be a ^-definable 
formula which defines a total order < on D. Assume that Di n d(0) JS o-minimal 
when equipped with that ordering. Then any subset of D definable in M is a 
finite union of < -convex sets. 

3.2 Naming a submodel 

Let M be a model of T. A subset D C M k of the form 0(M; 6) for some 
4>(x; b) G L{U) is called an externally definable set of M and the formula 
4>(x; b) is called an external definition of D. 
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Proposition 3.9. Let D C M k be an externally definable set. Then there is 
an external definition (j)(x;b) G L{U) of D with the following property: 

for every formula 6(x; a) G L(M), \x\ = k, such that D C 9(M; a), we have 
U |= <f>(x; b) -»■ 9(x; a). 

Proof. Let M -< N, with N an |M| + -saturated model. Let 4> (x;b ) G L(N) 
be an external definition of D. Theorem 13.11 applied to the pair (N, M) and 
the formula 4>o(x; bo) yields an elementary extension (N, M) -< (N', M') and a 
formula <p(x\ b) G L(M') such that D C 6) C o (x; 6 )- Let 0(x; a) G L(M) 
be a formula such that D C 0(M;a). Then (JV,M) (= (Vx G P)<po(x]b ) -> 
8(x;a). Therefore the same sentence holds in the pair (N',M r ). We conclude 
that 0(M'; b) C O (M'; 6 ) C 0(M'; a). As M' is a model, and 6 G M', we have 
M' |= (j)(x\ b) — > 6{x; a), as required. □ 

If M is a model of T we define the Shelah expansion of M, denoted M Sh 
as follows. The language of M Sh is Lsh{M) containing, for each integer k and 
each externally definable D C M fc , a /c-ary predicate Ro(x). The universe of 
M 5/l is M and the predicates are interpreted in the canonical way. Note that in 
particular M Sh is an expansion of M and contains a constant for every element 
of M. 

Proposition 3.10. The structure M Sh admits elimination of quantifiers. 

Proof. We have to show that the projection of an externally definable set is 
again externally definable. Let D C M kl+k2 be externally definable. Let 
<p(xi,X2',b) G L{U) be an external definition of D given by Proposition 13.91 
(where \x\\ = k\ and |rc 2 1 = k 2 ). Let n denote the projection M kl+k2 — > M kl 
and set ijj{x\; b) = (3x2)(j>(xi, x 2 \ b). We claim that 7r(D) = ip(M; b). 

It is clear that tt(D) C ip(M;b). To show the other inclusion, take some 
a G M kl \ n(D). Let C{x 1 ,x 2 ) = x 1 ^ a. Then D C C(M). By hypothesis on 
0(xi,x 2 ;&), we have 0(xi,x 2 ;fe) — >■ CC^i^ ^2)5 therefore ^(^li — > Xi 7^ a. It 
follows that a ^ ip(M; b) as required. □ 

Corollary 3.11. The structure M Sh is NIP. 

Proof. This follows from Proposition 13 . 101 since the formula Rd{x,vi) cannot 
have alternation rank greater than that of 4>(x,y) where <f>(x,y) G L{U) is an 
external definition of D. □ 

Remark 3.12. There are structures with IP which satisfy Proposition 13.101 In 
fact, there are structures M in which every subset of M k for all k is externally 
definable. This is the case for example if M a model of arithmetic. 
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Exercise 3.13. Show that there is a canonical bijection between types over 
in the structure M Sh and L-types over U, finitely satisfiable in M. 

Exercise 3.14. Let M \= T, (j>(x,y;d) G L(U). Assume that <fi(M;b) is the 
graph of a function. Then there is a formula ip(x, y; d) G L(U) such that 
4>{M] b) = ip(M; d) and U \= (Wx)(3- 1 y)ip(x, y; d) (so if)(x, y\ d) is the graph of 
a partial function). 

3.3 Shrinking of indiscernible sequences 

We are interested now in the case where A = I is an indiscernible sequence. 
We study the trace on I k of definable sets in k variables. The case k = 1 has 
already been dealt with in Chapter [2J the trace on / of a definable set is a 
finite union of </-convex sets. 

We will give several statements. Later results generalize previous ones, so 
there is some redundancy. 

Definition 3.15. Let (X, <) be a linear order. A convex equivalence relation 
~ on I is an equivalence relation all of whose classes are convex sets. The 
equivalence relation ~ is said to be finite if it has finitely many classes. 

If ~ is a convex equivalence relation on (X, <), we extend ~ to cartesian 
powers of X in the following way: if i = (ii, . . . , i n ),j = (ji, . . . ,j n ) G X n , then 
we set % ~ j if: 

- for all k <n, i k ~ j k ; 

- for all k, k! < n, we have i k < i k > j k < j k > and i k = i k > <^=^ j k = jy. 

If c G X, we define ~ c as the equivalence relation on X defined by: i ~ c j if 
either i = j = c, or (i < c and j < c), or (i > c and j > c). If c C X is a family 
(or a set) of elements, we define ~ g as the intersection of the relations ~ c for 
c ranging in c. 

Proposition 3.16. Let I = (c^ : i G X) be an indiscernible sequence. Let 
4>(xi, . . . , x n ; b) G L{U) be a formula. Then there is a finite convex equivalence 
relation ~ onX such that for all i G X n , j G J n , we have 

i~J^ <t>{tti\ b) 0(aj;6). 
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Proof. Without loss of generality, and for simplicity of notations, we assume 
that J is a sequence of singletons (since we can work in M eq for example). 

First assume that I is totally indiscernible. The structure 1^(0) is an 
expansion by definitions (naming constants) of I with pure equality. Therefore 
by Proposition 13 .7\ the set I is stably embedded. Let i/)(xi, . . . ,x n ;m), with 
m G I be such that ip(I;m) = 0(7; m). Without loss, ip(I;m) is a formula in 
the language of pure equality. Then the relation ~ m has the required property. 

Assume now that I is not totally indiscernible. Let M be some |/| + - 
saturated model containing I. There is some formula 6(x,y) G L(M) which 
orders I (namely such that M \= #(aj,Oj) -<=>- i < j). Consider the pair 
(M,I). By Corollary 13.51 there is an elementary extension (M, J) -< (M',I r ) 
and a formula ip(xi, . . . , x n ; d) G L(I') such that 0(7; b) = d). Note that 
the formula 8(x,y) defines a linear order on I' and thus ordered, I' is an in- 
discernible sequence. Write I' — (a^ : % G X') with X C X'. It follows that the 
formula i/j{xi, . . . ,x n ; d) is equivalent on I', to some boolean combinations of 
formulas of the form Xi = d^, 6(xi,dk), Xi = Xj, 8(xi,Xj) where each c4 is a 
element of the tuple d. We define ~ on X as being the restriction of the relation 
~d of X'. We see that ~ has the required property. □ 

Theorem 3.17. Let A be a small set of parameters and I — (aj : i G X) an 

indiscernible sequence. Let <j)(x, y\ b) G L{U) with x = (x±, . . . , x n ). Then there 
is a finite convex equivalence relation ~ on X such that for i, j G X™, we have 

i ~ j We G A, 0(a^, e; b) -H- 0(aj, e; b). 

Proof. The proof is similar to that of Proposition 13.161 except that instead of 
naming the sequence J, we name the product A x I (as a subset of M eq ). 

Take M a sufficiently saturated model containing A and / and consider the 
pair (M, Ax I). By Corollary 13 .51 there is an elementary extension (M, A x I) -< 
(M',A' x I') and a formula ip(x±, . . . ,x n ,y;a,m), a E A, m £ I' such that 
0(7 x A; 6) = x A;a,m). By elementarity, the set I' can be ordered as 
I' = (a, : z G X'), with X C X', such that it becomes a sequence indiscernible 
over A' (this is true also if 7 is totally indiscernible, in which case we can take 
any order on I'). One then checks that the relation ~ on X defined as the 
restriction of ~ m on X' has the required property. □ 

Remark 3.18. If the indiscernible sequence I is indexed by a complete order X, 
then the relation ~ in Theorem 13. II can be taken to be of the form ~ g for some 
c G I n . (Because any finite convex equivalence relation on X is of that form). 
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If furthermore there is a formula 6(x, y) which orders / and has parameters 
in /, then / is stably embedded. 

(This may fail even if I is not totally indiscernible. Take for example 
a circular order (M, R(x,y, z)), where R(x,y,z) is the betweenness relation 
and an indiscernible sequence I — (a^ : % G Q) of points from it. Then the 
involution H- a_fc is elementary, so we cannot define the order on I without 
extra parameters. This is the only possible obstruction; see |4T|). 

We say that a convex equivalence relation ~ is essentially of size k if it 
is the intersection of n many finite convex equivalence relations. Note that a 
relation essentially of size k has at most 2 K many classes. 

Remark 3.19. If X is a linear order, let compl(X) be the completion of X. Let 
~ be any convex equivalence relation on X essentially of size k. Then there is 
c C J of size < k such that the restriction of ~ 5 to X refines ~ and for every 
i, j G X \ c, we have i ~ j -<=>- i ~ E j. 

Therefore when given a convex equivalence relation on X which is finite 
(resp. essentially of size n), we may assume that it is of the form ~ g for some 
finite c C compl(X) (resp. c of size < hi). 

Corollary 3.20. Let A be a small set of parameters and I — (oj : i G X) an 

indiscernible sequence of finite tuples. Let b be a finite tuple. Then there 
is a convex equivalence relation ~ onX essentially of size \T\ such that for any 
4>(x,y; z) G L with x = (x\, . . . ,x n ) for any i,j G X n , we have 

i ~ j We G A, 0(cq, e; b) <r> 0(aj, e; b). 

Note that we recover the claim stated during the proof of Proposition 12.141 
that if the cofinality of X is greater than \T\, then there is an end segment of 
/ which is indiscernible over Ab. 

It turns out that this result is also true when the sequence / is composed 
of tuples of infinite length, although we need to redo the proof in that case. 

Theorem 3.21. Let A be a small set of parameters and I — (oj : i G X) an 

A-indiscernible sequence of tuples of arbitrary length. Let b G U be a finite 
tuple. Then there is a convex equivalence relation ~ on X essentially of size 
\T\ such that for any 0(x, y; z) G L with x = (x\, . . . , x n ) for any i,j G T n , we 
have 

i ~ j We G A, 0(a^, e; b) -H- 0(aj, e; b). 
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Proof. Let M be a sufficiently saturated model containing A, I and b. Write 
ai = (4 : j < a). We consider two new unary predicate P(x) and A(x) 
and three new binary predicates E(x,y), F(x,y) and H(x, y). Let L' = L U 
{P, A, E, F, R} and expand M into an //-structure (M, P, A, E, F, R) where 
P = {4 :j<a,iel}U A,E = {(4,4) ■ * £ IJ,f < a}, F = {(4,4) : 
G X, j < a} and R = {(4,4) :%<%'& I,j,j' < a}. 

Let (M, P, A, E, F, R) -< (M' , P', A', E', F', R') be some |M|+-saturated el- 
ementary extension. Using the extra structure given by E, F and R, we can 
write P' = A' U {4 : i < f3} for some (3 > a and X' D X and such that 

the sequence ((4)j<P '■ i ^ is indiscernible over A. By Corollary 13. 6[ there 
is some set P C P' of size at most \T\ such that for any two tuples a, a' G P, 
we have 

a =p a =^> a =b a' . 

Let c C X' be the set consisting of elements c G X' for which there is j < (3 
such that a J c G Aq. The equivalence relation ~ g on X' is a convex equivalence 
relation essentially of size \T\. Let ~ be its restriction to X. Then ~ has the 
required property. 

□ 

Exercise 3.22 (Critical points). Let I — (oj : i G X) be an A- indiscernible 
sequence of finite tuples. Assume that the order X is dense complete without 
end points. Let b G U be a finite tuple. Let <j)(xi, . . . , x n , y; z) be a formula with 
\xk\ = \ai\ and \z\ = \b\. Call an index i G X 0-critical if there is k G {1, ... , n}, 
some e G A and indices i\ < ■ ■ ■ < ik < ■ ■ ■ < i n in X such that for any 
< i < we can find < ik,i' k < such that: 

\= 4>(a h , . . . , a ik , . . . , a in , e; b) A ->^(a il , . . . , a^, . . . , a in , e; b). 

Let c denote the set of (p-critical points and define ~ 5 as usual. 

1. For any two hnite tuples X n and any e G A, we have 

(A) % ~ g j =J> (j)(aj, e; b) -B- ^(aj, e; b). 

2. For any convex equivalence relation ~ satisfying (A), if i,j G X are not 
critical points, then i ~ j i ~ 5 j '. 

3. There are finitely many (p-critical points. 

Note that in the previous exercise the relation ~ e is definable in the pair 
(M, I) and each 0-critical point is in dcl(Ab) read in that structure. 
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Historical Remarks 

Honest definitions were introduced by Chernikov and Simon in [23]. Corollary 
13.51 (weak stable-embeddedness) was proved previously by Guingona in [32] 
using a different method. Proposition 13.101 (Shelah's expansion) was proved 
first in the o-minimal case by Baisalov and Poizat in [TT] and in full generality 
by Shelah in [83]. The results on shrinking of indiscernibles were first obtained 
by Baldwin and Benedikt in [13], without the parameter set A, and generalized 
by Shelah in [83]. The proofs we give here are new. 



Chapter 4 

Strong dependence and dp-ranks 



We introduce the notion of dp-rank, which is similar to weight in stable theories. 
It is a measure of the complexity of a type in an NIP theory. A theory is 
strongly- dependent if all types have dp-rank < K . In such theories, shrinking 
of indiscernibles can be done at once for all formulas, using only finitely many 
cuts. 

We assume NIP throughout this chapter. 

4.1 Mutually indiscernible sequences 

Definition 4.1. Let (It : t G X) be a family of sequences and A a set of 
parameters. We say that the sequences {/ t : i G X} are mutually indiscernible 
over A if for each t G X, the sequence It is indiscernible over A U {// : I G 
X\{t}}. 

Example 4.2. Let / = (a, : i 6 I) be an A-indiscernible sequence. Let 
(X t : t G X) be a family of pairwise disjoint convex subsets of X. For t G X, let 
I t — (aj : i G Xj). Then the sequences {/ t : t G X} are mutually indiscernible 
over AU {a>i : i el \ \J teX l t }- 

Example 4.3. Let (pt : t < a) be a family of global A-invariant types. Build 
a family (I t : t < a) of sequences such that for each t G X, is a Morley 
sequence of p t over A U {/; : I < i}. Then the sequences {I t : t G X} are 
mutually indiscernible over A. 

[ Proof: Observe that if / is indiscernible over A, p is a global A-invariant 
type and a |= p \ AI, then / is indiscernible over Aa. In particular this holds 
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if a realizes a Morley sequence of p over AI. It is immediate by invariance of p. 
More generally, if the sequences {It : t < f3} are mutually indiscernible over A, 
then they remain so over Aa where a \= p \ A U {It : t < f3}. Thus the result 
follows by induction. ] 

Example 4.4. Let (p t : t < a) be a family of global A-invariant types such 
that pt(x) (g) Psiy) = Psiy) ® Pt(%) for any t, s < a, t ^ s. Build an array 
(a* : t < a, k < oj) such that a\ \= p t \ A U {af : (I < k) or (I = k and s < t)}. 
For t < a, define I t = {a\ : k < u) . Note that it is a Morley sequence of p t 
over A. 

Then the sequences {It : t G X} are mutually indiscernible over A. 
[ Proof: We show that for each t G X, J t is a Morley sequence of p t over 
{/s :s£l \ {£}}• Let N < u and pick pairs 

(fc , t ) > ■ • • > (fcn-l, *n-l) eu X a, 

where wxais ordered lexicographically. Let c < n be minimal such that t c = t 
(we assume such a c exists). We have 

4V ' ■ ' ^fc 1 ! 1= P*o fao) ® • • • ® Pt n -t (Xn-l) I A- 

As the type commutes with p tm for m < c, we have 

4V ' ' ' ^fc 1 ! h Pfc Oc) ® fto Oo) ® ■ ■ ■ ® ft c O c ) <g> ■ ■ ■ ® p tn _ 1 (a; n _i) U- 

Where Pt c (x c ) means that this term is omitted. Therefore a fe c |= pt c f A U 
{a^, m : m 7^ c}. It follows that the sequence I t is a Morley sequence of p 4 over 
AU {I s : s 7^ t} and we conclude as in the previous example. ] 

Exercise 4.5. Let {It : t G X} be mutually indiscernible sequences. For 
t G X, let p t = \im(I t ). Then for t,s G X, t ^ s, the types p t and p s commute 
(i.e., p t {x) ®p s (y) = p s {y) ®p t {x)). 

Remark 4.6. Let (I t : t G X) be a family of sequences, with I t = (a\ : i G X t ). 
Assume that the sequences (It : i G X) are mutually indiscernible over A. Let 
J be a linearly ordered set, and for t G X, let cr^ : ,7 — > X be an increasing 
embedding. Then the sequence ((4 t (i))* eX : i G is indiscernible over A. 

Proposition 4.7. Let (I t : t G X) 6e a family of sequences mutually indis- 
cernible over A. Let b £U be a finite tuple. Then there is a set Xj, C X of size 
at most \T\ such that the sequences {It ". t G X\X&} are mutually indiscernible 
over Ab. 
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Proof. For simplicity assume that the sequences I t are sequences of finite tuples. 
(The case of infinite tuples can be taken care of as in the proof of Theorem 
13.21 p . By working in M eq , we may assume that they are sequences of singletons. 
Write I t = (a\ : i G It). Let P(x) and A(x) be two new unary predicates, 
and Hj(x,y), H(x,y) be new binary predicates. Let V = L U {P, A, E, R}. 
We define an //-expansion (M, P, A, E, R) of M by interpreting P(x) as the 
set P = {a\ : t G X,i G X t } U A, A(x) as the set A, R(x,y) as the set 
R= {{al a\,) :teX,i<i' G X t }. 

Let (M,P,A,R) -< (Af , P', A', R') be some |M|+-saturated elementary ex- 
tension. Using the extra predicates added to the language, we see that there 
is some I'DI and sets X t for t G X', with X t C X[ when t G X such that P' 
can be written as P' = A' U {a* : t G X',z G X^} and such that the sequences 
{I' t = (a* : z G Xt) : t G X'} are mutually indiscernible over A'. 

By Corollary 13.61 there is some Po C P' of size at most \T\ such that for 
any finite tuples a, a' G P we have a =p a' =^ a a'. Let Xj C X be the set 
of elements t G X such that some a\ belongs to Po. Then Xo has size at most 
\T\ and we see that the sequences {It ". t G X \ Xb} are mutually indiscernible 
over Ab. 

□ 

Exercise 4.8. Let (pt : t < \T\ + ) be a family of global A-invariant types such 
that Pi(x) ®Pj{y) = Pj(y) ®Pi{x) for all i ^ j. Let q be any global invariant 
type. Then there is t < \T\ + such that q(x) ®Pt{y) = Pt(y) ® q{x). 

[Hint: Build first sequences (It : t < \T\ + ), each one being a Morley se- 
quence of pt over the previous ones. Then take b realizing q over those se- 
quences. Finally realize again a family (J t : t < \T\ + ) of Morley sequences 
of the p t 's over what has been constructed so far. Notice that the sequences 
(I t + Jt ■ t < \T\ + ) are mutually indiscernible and apply Proposition 14. 71 ] 

Exercise 4.9. Let (I t :t£ X) be a family of mutually indiscernible sequences, 
indexed by the same linear order X. Write It = (a* : i G X). Define I = ((a\ : 
t G X) : i G X). If A is any set such that I is indiscernible over A, then the 
sequences {I t : t G X} are mutually indiscernible over A. 

Exercise 4.10. Let (It : t < a) be a family of mutually indiscernible endless 
sequences. For t < a, let p t = hm(J t ). Let J be any linear order. Construct 
sequences (J t : t < a), J t = (a\ : i G J) such that for each t < a and i G J ', 
a \ H Pt \ U{Is : s < a} U {a s j : s < t or (s — t k, i < j)}. Then the sequences 
{I t + Jt ■ t < a} are mutually indiscernible. 
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[Hint : Let t < a and set A t = \J{I t : t < a] U \J{J S : s <t}. Show that 
It + Jt is indiscernible over A t . Also, as Jt realizes some I t -invariant type over 
A t , any sequence in A t indiscernible over I t remains indiscernible over I t + J t . 
The result follows by induction on t < a.] 

4.2 Dp-ranks 

Definition 4.11. Let p be a partial type over a set A, and let k be a (finite 
of infinite) cardinal. We say dp-rk(p, A) < k if for every family (J t : t < k) of 
mutually indiscernible sequences over A and b \= p, there is t < n such that I t 
is indiscernible over Ab. 

If a ElA, then dp-rk(a/A) stands for dp-rk(tp(a/A), A). 

We implicitly allow the sequences It to be sequences of infinite tuples. 
However, if we impose that they be sequences of finite tuples, then we ob- 
tain an equivalent definition. This is because if an indiscernible sequence 
/ = (aj : i G X) is not indiscernible over some b, then there are finite a! i C a i 
such that the sequence (aj : i 6 X) is indiscernible, but not indiscernible over 
b. 

Remark 4.12. For p a type of a finite tuple, by Proposition \A.7\ we always have 
dp-rk(p, A) < \T\+. 

We say that dp-rk(a/A) = k if dp-rk(a/A) < k + , but not dp-rk(a/^4) < k. 
Note that it may happen that we have dp-rk(a/A) = k for no value of k. 
Consider for example the following situation: for every integer n, we can find 
a family (I t : t < n) of mutually indiscernible sequences over A, none of which 
is indiscernible over Aa, but we cannot find such a family of size Ko- We would 
then have dp-rk(jo, A) < K , but dp-rk(p, A) > n for all n. One could probably 
write "dp-rk(a/y4) = Kq" (as done for example in [2]), but we will not use that 
notation. 

Lemma 4.13. Let p be a partial type over A and k any cardinal. Let also 
AC B. Then dp-rk(p, A) < n <^=^ dp-rk(p, B) < n. 

Proof. Assume that dp-rk(p, A) < n. Let (I t :t< k) be mutually indiscernible 
over B sequences and let b |= p. For t < n, write I t — (a\ : i e T t ) and define 
the sequence J t = (a\"B : i 6 X t ). Then clearly, the sequences (J t : t < k) are 
mutually indiscernible over A. By hypothesis, there is t < k such that J t is 
indiscernible over Ab. This implies that I t is indiscernible over Bb. 
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Conversely, assume that we have a witness of dp-rk(p, A) > k. Namely, we 
have some b \= p and a family ( J t : t < k) of sequences, mutually indiscernible 
over A such that no It is indiscernible over Ab. Composing by an automor- 
phism, we may assume that the sequences are mutually indiscernible over B. 
Then we obtain a witness of dp-rk(p, B) > k. □ 

Remark 4.14. If p C q are partial types over B, then clearly dp-rk(p, B) < k 
implies dp-rk(g, B) < k. It follows that if A C B and aGW, then dp-rk(a/A) < 
k implies dp-rk(a/5) < k. 

Proposition 4.15. Let p be a partial type over A and let k be any cardinal. 
Then we have dp-rk(p, A) < k if and only if for any family (I t : t G A) of 
sequences, mutually indiscernible over A and any b |= p, there is A C X of 
size < k such that {I t : t G X \ A } are mutually indiscernible over Ab. 

Proof. It is obvious that the property considered implies dp-rk(p, A) < n. We 
show the converse. 

Case 1 : n is infinite. 

Assume that dp-rk(p, A) < n and assume that (I t : t G X) and b give a 
counter-example to what we have to prove. Without loss, X = k. We can build 
an increasing sequence (5 t : t < k) of ordinals, and a sequence (A t : t < k) of 
finite subsets of k such that: 

- for all t < K, the sequence I$ t is not indiscernible over {&}U|J{/ S : s G A t }; 

- for all t<f<K, (A t U {5 t }) n (A t / U {S t >}) = 0. 

Let B = A U : s e A t ,t < k}. Then the sequences {/^ : t < k} are 

mutually indiscernible over B, and for each t < k, It is not indiscernible over 
Bb. This contradicts dp-rk(p, B) = dp-rk(p, A) < k. 

Case 2 : k = n + 1 is finite. 

Assume that dp-rk(p, A) < n+1 and let (I t : t G A) be sequences, mutually 
indiscernible over A. We may assume that A is finite and we show the result 
by induction on |A|. 

If |A| < n, the result is obvious as we may take A = A. 

Assume that |A| = n + k + 1, and we have shown the result for sets of 
cardinality < n + k. Let (It : t < n+k + 1) be mutually indiscernible over A and 
let b \= p. We may assume that the sequences I t are endless. For t < n + k + 1, 
let p t denote the limit type lim(/t). Construct sequences I[ = (c\ : k < to), 
such that 4 \= p t \ AbU {I t : t < n + k} U {I' s : s <t} U {cj : I > k} (so those 
are Morley sequences of each p t , but read backwards). 
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Then the sequences {I t + I' t : t < n + k + 1} are mutually indiscernible over 
A (see Exercise 14. 10p . 

Let B = A U \J{I' t : t < n + k + 1}. 

As dp-rk(p, £?) = dp-rk(p, A) < n, there is some t < n + k + 1 such that J t 
is indiscernible over Bb. Without loss, assume that t = 0. 

By induction hypothesis, working over the base set AI , there is some X Q C 
{1, . . . ,n + /c} of size at most n such that the sequences {I t +I' t : t £ {1, . . . ,n + 
/c}\Aq} are mutually indiscernible over A/q^- Again, without loss, assume that 
the sequences {I t +I' t : < t < k + 1} are mutually indiscernible over AI b. If I 
is indiscernible over AU|J{/ t : < t < k + 1} then the sequences {I t : t < & + 1} 
are mutually indiscernible over Ab and we are done. 

Otherwise, there is some formula <f>(x, d) £ L(Ab U [J{It '■ < t < k + 1}) 
which witnesses that Jo is n °t indiscernible over U [J{It :0<t<A; + l}. 
As the sequences {I t + 1[ : < i < + 1} are mutually indiscernible over AI Q b, 
we can move the parameters in belonging to some Ik to parameters in I' k . 
Formally, there is some d' =Ai b d such that d! £ AI U U{-^t '■ < t' < k + 1}. 
Then 0(cc, d') witnesses the fact that Iq is not indiscernible over Bb. This 
contradicts the assumption on Iq. □ 

We recall that the notation compl(X) is used to denote the completion of 
the linear order X. Recall also the definition of the equivalence relation ~ s as 
defined before Proposition 13.161 

Theorem 4.16. Let p be a partial type over A, and k a cardinal (possibly 
finite). The following are equivalent: 

(i) dp-rk(j>, A) < k; 

(ii) o If (It : t £ X) are mutually indiscernible over A and b \= p, then there 
is some X$ C X of size < k such that for t £ X \ Xq, all the members of It 
have the same type over Ab; 

(ii)i Same as above, but change the conclusion to: for t £ X \ X , the 
sequence I t is indiscernible over Ab; 

(ii) i Same as above, but change the conclusion to: the sequences {It : t £ 
X \ X } are mutually indiscernible over Ab; 

(iii) o If I = (cii : i £ I) is a sequence (of possibly infinite tuples), there is 
some c £ compl(X), |c| < k such that i ~ g j =>- tp(ai/Ab) = tp(aj/Ab); 

(iii)i Same as above, with the conclusion that each ~ 5 -class is indiscernible 
over Ab; 
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(111)2 Same as above, with the conclusion that the ~ 5 -classes which are 
infinite are mutually indiscernible over Ab. 

Proof, (i) =>■ (ii) 2 • This is exactly Proposition 14.151 
(ii) 2 (ii) 1 (ii)o: Clear. 

(ii) o (i): Assume (ii)o and assume for a contradiction that there is some 
family (J f : t < k) of sequences, mutually indiscernible over A and b \= p 
such that for each t, It is not indiscernible over Ab. Without loss, each I t is 
indexed by Q: I t — {a\ : i G Q). Then for each t, there are a t < f3 t G Q such 
that tp (a a JAC t a) 7^ tp(a^JAC t a) where C t = {a\ : i < a t or i > (3 t }. Set 
I[ = (atfCt : a t < i < f3 t ). Then the family (I' t : t < k) contradicts (ii)o- 

(iii) 2 =>- (iii)i =>- (iii)o: Clear. 

(ii) 2 =>• (111)2: Assume (ii) 2 and let / = (a^ : i G X) be indiscernible over 
A, and b \= p. Without loss, X is a complete order, so compl(X) = X. We 
may extend / to some sequence J = (a^ : % G J) where X C J7" for J some 
sufficiently large complete order and J is indiscernible over A. As X is complete, 
to every point x G J which lies in the convex hull of X, there is a unique point 
7r(x) G X such that 

(Vz G X)(z > 7r(x) =>- % > j) A (i < 7r(x) =>- z < j). 

As ,7 is a complete order, for every formula <f>(xi, . . . , a; n ; y), with = |aj| 
and I y\ = \b\, there is some finite G J m such that for any i, j G ,7™, we have 
* ~<L, J ^ 0( a «i ^ b). Let d be the union of (the ranges of) all tuples 
g?0 for ranging over all formulas as above. 

Increasing J if necessary, we can find open disjoint intervals (Ji : i G XUd) 
such that i G Ji for all i G X, and d (z Jd for all <i G d. 

Let be the (convex) equivalence relation on ,7 defined by: jEj' if and 
only if, for all i G d, either (j G Ji and j' G ,7) or (j < ,7 and j 1 < Ji) or 
(j > Ji and j' > Ji). Let .ft be the family of all E'-classes. Then the sequences 
{(dj : i G X) : A G .ft)} are mutually indiscernible over Ab. Furthermore, by 
construction of E, for each which intersects the convex hull of X, there 

is some cx G X such that Cy = 7r(x) for all x G A. 

By (ii) 2 , there is some $0 C .ft of size < k such that the sequences in 
{(cj : i G A) : A G .ft \ .fto} are mutually indiscernible over Let .ft* C .ft 
denote the set of sequences which intersect the convex hull of X. Finally, let 
c = (cx ■ A G £*). Then |c| < «. 

We show that ~ g has the required property. Let <p(xi, . . . , x n \ y) G 7(A) be 
a formula. Let ii < ■ • • < i n G X \ c and let k G {1, . . . , n}. It is enough to 
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show that |= 4>{ a iii ■ ■ ■ i a i n j b) -H- 0(0^, . . . , , . . . , a» n ; 6) for any i' fc £ X such 
that ik-i < i' k < ik+i and i' k ~ g ik- 

Assume this does not hold, as witnessed by some i' k > ik- Let 

U = inf{j £ J : j > i k & \= ^4>{a h , a in ; b) <H> 4>(a h , ...,aj,..., a in ; 6)}. 

Then i* belongs to the tuple d^. In particular, z* lands in the interior of some 
member of R. As ik < i* < i' k and i k ~g i'k, i* is in some sequence X of M.\ 8$. 
Therefore the sequence (a; : i £ X) is indiscernible over A6U{aj fc : k < n,a ik ^ 
A}. This contradicts the definition of i*. 

(iii)o =>■ (ii)o : Let (I t : t < a) be mutually indiscernible over A, and let 
6 |= p. Write 7* = (a* : i £ X t ), where the indexing orders X t are disjoint. Let 
X denote the linear order © t<Q X t . We may expand the family (I t : t < a) 
into a family (J t : t < a) of sequences, mutually indiscernible over A such that 
J t — (a\ : i £ X) (in particular, 7 4 is a subsequence of J t ). Let J = {a^a\" ■ ■ ■ : 
z £ X). Then J is an indiscernible sequence. By assumption (iii)o, there is 
some c C compl(J) of size < k such that 

i ~ c - j =^ tp(a°X lA • • • /A6) = tp(a?*a^ " ' ' l Ah )- 

Let Xq C a be the set of t < a such that Ao D It ^ 0. Then |Ao| < k and 
A satisfies the requirement of (ii)o- □ 

Example 4.17. In (iii) 2 , we insist that the infinite classes are mutually indis- 
cernible. If the indiscernible sequence I is densely ordered, then all classes are 
either infinite or reducted to a point. So the conclusion says nothing about 
the latter ones. We could wonder whether we can ask for the infinite classes 
to be mutually indiscernible over (A and) the finite ones. This turns out to be 
an even stronger property, which we could call dp-rk + (p, A) < k (because it is 
linked with what Shelah calls strongly + -dependent). We will not go into this, 
but only give an example where the two notions differ. 

Let T =ACVF, the theory of algebraically closed valued fields (see Section 
IA.2[) . Let (aj : i £ Q) be an indiscernible sequence of elements such that v(aA < 
v(a,j) for all % < j. Then we can find a point b such that v(b — (ao + • • • + a n )) > 
v(a n ) for all n < uj (we then necessarily have v(b — (ao + • • • + a n )) = v(a n+ i). 
In other words, 6 is a pseudo-limit of the sequence (ao + • • • + a n : n < cu)). 

Then the point b splits the sequence I into three pieces: (a; : % < 0), (a ) 
and (aj : i > 0). The two infinite ones are mutually indiscernible over 6. 
However, the sequence (aj : i > 0) is not indiscernible over ao&. In fact that 
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tuple breaks it again into three pieces: (aj : < i < 1), (ai) and (a.; : i > 1). 
And we can go on. 

(In Shelah's terminology, this shows that ACVF is not strongly + -dependent. 
However it is strongly dependent, even dp-minimal, as defined below.) 

Proposition 4.18. Let a,b and A a small set of parameters and k±, k 2 two 

cardinals such that dp-ik(a/Ab) < Ki and dp-ik(b/A) < n 2 , then dp-rk(a, b/A) < 

«1 + «2 — 1- 

Proof. Here k — 1 is equal to k when k is infinite. 

We use condition (ii) 2 - Let (I t : t G X) be mutually indiscernible over A. 
There is some X C X of size < «i such that the sequences I t , t £ X \ X are 
indiscernible over Ab. Then there is some X\ C X \ X of size < k 2 such that 
the sequences I t , t G X \ (X U Xi) are indiscernible over Aab. □ 

Note in particular that the hypothesis of the proposition are satisfied if 
dp-rk(a/A) < Hi and dp-rk(&A4) < k 2 - 

Another characterization of dp-ranks is by Shelah-style arrays. 

Definition 4.19. Let p(y) be a partial type over a set A. We define n ict (p, A) 
as the minimal k such that the following does not exist: 

- formulas (fi a (x a ; y); 

- an array (a" : i < u, a < k) of tuples, with \af\ = \x a \; 

- for every 77 : k, — > uj, a tuple b v such that we have 



(Such a family of tuples and formulas will be called an ict-pattern for p. 
The cardinal k is the length of the pattern). 

Proposition 4.20. For any partial type p over A and cardinal n, we have 
dp-rk(p, A) < k if and only if Ki C t(p, A) < k. 

Proof. Assume there is an ict-pattern for p of length k. Then by expanding 
and extracting, we may assume that the rows (a" : i < u), a < n, are mutually 
indiscernible over A. Then for any 77, b v witnesses that dp-rk(p, A) > n. 

Conversely, assume that dp-rk(p, A) > k. By condition (ii)o of Theorem 
14.161 we can find: 

- sequences (I t : t < k), mutually indiscernible over A, without loss I t = 
{a\: i < oj) are sequences of finite tuples; 
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- formulas (f>t(xt] y)] 

- a tuple b |= p such that |= 0t(ct o ; fy A -*j) t (a\] b). 

Let tp t (x u x' t ;y) = <fi t (x t ;y) A ^(f) t (x' t ; y) . For t < «, let J t = (a^V 2i+1 : 
% < u). By NIP, at most finitely many tuples a a! from J t satisfy ip t (x t ,x' t ; b). 
Removing some if necessary, we may assume that a t Q "a\ is the only element of 
J t for which that formula holds. By mutual indiscernability of the sequences 
{Jt '■ t < k}, for every rj : k — > u, we can find some b v such that, for all t < k, 



Note that the difference between dp-rk and Ki ct is only due to the convention 
chosen for the definition of dp-rank. One argument in favor of this definition, 
as opposed to that of Ki Ct , is that 1- types in dp- minimal theories are of rank 1 
and not 2. 

4.3 Strongly dependent theories 

Definition 4.21. The NIP theory T is strongly dependent if for any finite tuple 
of variables x, we have dp-rk(x = x, 0) < No- 
Example 4.22. Let £ = {Ei : i < u} where the E^s are binary predicates. 
Let T be the £-theory stating that each Ei defines an equivalence relation 
with infinitely many classes, each of which is infinite. Consider T\ D T stating 
that (yxy)xEi + iy — > xEiy, and each i?j-class is split into infinitely many E{+i 
classes. 

Consider also T 2 DT stating that the E^s are cross-cutting, that is: given 
do, . . . , a n -i, there is a such that aE^a^ for every k < n. 

Then T\ is strongly dependent, but not T 2 (and note that both are stable). 

Remark 4.23. If T is a superstable theory, then it is strongly dependent, but 
the converse does not hold as witnessed by T\ in the previous example. 

Proposition 4.24. If for every variable x with \x\ = 1, we have dp-rk(x = 
x, 0) < K 0; then T is strongly dependent. 

Proof. This follows immediately from Proposition 14.181 □ 




This shows that Ki Ct (p,A) > k. 



□ 



An extreme case of strongly dependent theories are dp-minimal theories. 
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Definition 4.25. The theory T is dp-minimal, if dp-rk(x = x, 0) = 1, for x a 
singleton. 

Example 4.26. The following theories are dp- minimal (see Appendix |A] or 
[28]): 

- any o- minimal theory; 

- any C-minimal theory (for example, ACVF, the theory of algebraically 
closed valued fields; 

- the p-adics; 

-the theory of (Z,0, l,+,<). 

We point out some link with honest definitions. First an easy lemma. 

Lemma 4.27. Let A (Z 14. Let (p t : i < u) be a family of A-invariant types 
and let (cj, di : % < uj) and b be finite tuples such that: 

~ Q (= Pi I" ^4c<jd<j/ 

- di |= pt I" Ac^d^; 
-tp(ci/i46) ^tp(di/Ab). 

Then T is not strongly dependent. 

Proof. Build sequences Ii = (e* : j < u) for i < uj such that Ii is a Morley 
sequence of pi over AI Ki . Then we see that tp((cj, di : i < oj)/A) = tp((eQ, e\ : 
i < uj)/A). Therefore, composing by an automorphism, we may assume that 
Cq = Ci and e\ = di for alH < uj. Then the sequences {Ii : i < u} are mutually 
indiscernible over A, but none of them remains indiscernible over Ab. This 
implies that T is not strongly dependent. □ 

Proposition 4.28. Let T be strongly dependent. LetM\=T,A<ZM,be M 
a finite tuple, and (M,A) -< (M',A') some \M\ + -saturated extension. Then 
there is Aq C A' finite such that every formula 4>(x; b) G L(b) has an honest 
definition over A with parameters in AAq. 

The conclusion means that we can find i/)(x] c) G L(AA ) such that <f>(A; b) C 
^{A'-c)<Z<j>{A'-b). 

Proof. For x a finite tuple of variables, let C S X {U) be the set of global 
types, finitely satisfiable in A. By Lemma 14.271 we can build a maximal se- 
quence (ci,di : i < N) such that there are types p, G S£. with: 

_ c i l = Pi \ ^- c <id<ii', 

- di |= pi r AcKid^; 
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-tp(ci/A6) ^tv(di/Ab). 
Set A Q = {a,di : i < N}. 

Let now 0(x; b) be given. Then for any type q G S£, there is a truth value 
e q G {0, 1} such that for any realization a of q\AA m A', we have |= 0(a; 6) e<! . 
By saturation of the pair (M', A'), there is a formula ip q (x) G g|x4 suc h that 
(M',A') \= (Vz G P)^(x) ->■ 

We then conclude exactly as in the proof of Theorem 13.11 (Namely, we 
extract from {ip q (x) : e q = 1} a finite subcover, and take the union of those 
formulas.) □ 

Note that the converse does not hold, since any stable theory satisfies the 
conclusion of the proposition, but not all stable theories are strongly dependent. 

We end this section with a example of an application to definable groups. 
We say that a group G is of finite exponent if there exists n < u such that 
g n = e for all g G G. 

Proposition 4.29. Let G be a dp-minimal group. Then there is a definable, 
abelian, normal subgroup H such that the quotient G/H is of finite exponent. 

Proof. Claim: For any two definable subgroups H and K of G, one of [H : 
H n K] or [K : H n K\ is finite. 

Proof: Assume not. Then we can find two sequences (a* : i < u) of points 
of K and (&j : i < u) of points of H such that the cosets {a^if : i < u} are 
distinct, as well as the cosets {Kbi : i < u}. Note that for any i,j < oj, there is 
a point in aiHHKbj, namely a^bj. And as two distinct cosets of the same group 
are disjoint, we have aibj G ai>H fl Kby <^=^ (z, j) = (i',f)- By Proposition 
14.201 this shows that dp-rk(ajfej, 0) > 2, contradicting dp-minimality. 

It follows that if H and K are two definable subgroups, a G H and b G K, 
then there is n such that either a n G K or b n G H . If a G G, we let C(a) = 
{g G G : ga = ag}. Then applying this to the uniform family {C(a) : a G G} of 
definable groups yields: there is some integer n such that for any two elements 
a,b G G, there is k < n such that either a fc G C(&) or 6 fc G C(a). In particular, 
if N = n\, then a N and fe^ commute. 

Let H = C(C(G N )), then H has the required properties. □ 

Historical remarks 

Strong dependence was defined by Shelah in [83] as a tentative analog of su- 
perstable, and further studied in [77]. In that latter paper, he defines a whole 



4.3. STRONGLY DEPENDENT THEORIES 



51 



family of "dp-ranks" which depend not only on a type p and a set J, but also 
of a set B of indiscernible sequences, or pairs of points, "split" by p. This 
allows to make constructions where the rank drops. He uses them to prove 
that in a strongly dependent theory, any long enough sequence contains an 
infinite indiscernible subsequence. The notion of dp-rank we present here is a 
simplification of those ranks, it is also similar to Shelah's n ict , but restricted 
to a type. It was first explicitly defined by Usvyatsov in [89J and it coincides 
in NIP theories with what Adler had previously called burden in [2]. 

Proposition 14.151 and the additivity of dp-rank that follows from it is from 
[4"9] . (In which the convention for dp-rk(p) is slightly different. To be precise, 
dp-rk(p) < k for us is equivalent to dp-rk(p) < k — 1 in the sense of [30] • In 
particular, for infinite cardinals, that latter definition agrees with Kj Ct , but not 
for finite ones.) The equivalent characterizations given in Theorem 14.161 are 
from [83], except for (ii) 2 and (iii) 2 which make use of Proposition 14.151 

The notion of a dp-minimal theory originates in Shelah's work, but was 
precisely defined and studied by Onshuus and Usvyatsov in [6~4"j . 

Proposition 14.281 is new. Proposition 14.291 is from |88j . 
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Chapter 5 
Forking 



Given two sets A C B and a type p G S'(A), we want to define a notion of a 
free extension of p to a type over B. When A = M is a model, the work done 
in Chapter [2] provides us with a natural option: an extension of p is free if 
it extends to a global M-invariant type. However, it is not immediately clear 
how to generalize this to arbitrary base sets. 

The idea of forking is to give such a definition (or rather a definition of the 
opposite: the free extensions are the non-forking ones). This definition makes 
sense in any theory, and satisfies different kinds of properties depending on the 
characteristics of the theory. For example, in simple theories, non-forking is an 
independence relation which means in particular that it satisfies symmetry and 
transitivity (see for example [IE])- In NIP theories, this is not true, but other 
properties hold. In particular a type has only boundedly many non-forking 
extensions, and over models, non-forking coincides with invariance. As we will 
see, over an arbitrary set A, it is equivalent to bdd(A) -invariance. 

Even though the general definition of forking coincides, for NIP theories, 
with an apparently simple one, experience shows that it is useful to come back 
to it from time to time. 

5.1 Bounded equivalence relations 

In the first three sections of this chapter, we do not assume NIP. 

An equivalence relation E between tuples of U is A-invariant if whenever 
a =a a! and b =a b', then aEb •<=>- a'Eb' . The relation xEy is type-definable 
over A if it is defined by a partial type tt(x; y) over A. 
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Proposition 5.1. (T any theory) 

Let A C U and let E be an A-invariant equivalence relation on hi. Then 
the following are equivalent: 

(i) the setU/E is bounded (i.e., of size < R); 

(ii) \U/E\ < 2l A l + l T l; 

(Hi) for any A-indiscernible sequence (aj : i < u) and i,j < u, we have 
OiEcLj ; 

(iv) for any model M D A and a =m b, we have aEb. 

Proof, (ii) (i): Clear, 
(iv) =>- (ii): Clear. 

(iii) =^ (iv): Let a =m b and take p to be a coheir of tp (a/M). In particular, 
p is M-invariant. Let (aj : % < u>) be a Morley sequence of p over Mab. Then 
both (a) + (a« : i < u) and (b) + (a^ : i < u) are indiscernible sequences. 
Therefore by (iii), we have aEa$Eb. 

(i) =3- (iii): Assume there is some A-indiscernible sequence (aj : i < u) such 
that -i(aiEaj) for i ^ j. Then we may increase the sequence to one of size R. 
This contradicts (i). □ 

An A-invariant equivalence relation E is bounded if it satisfies one of the 
equivalent conditions above. 

Definition 5.2. We say that two tuples a and b have the same Lascar strong- 
type over A if we have aEb for any A-invariant bounded equivalence relation. 
We write Lstp(a/A) = Lstp(6/A). 

Lemma 5.3. The relation "having same Lascar strong-type over A" is the 
finest bounded A-invariant equivalence relation. 

It is the transitive closure of the relation Ra{o-] b) defined to hold if there is 
a model M D A such that a =m b. 

It is also the transitive closure of the relation ^(a;^) defined to hold if 
(a, b) is the beginning of some A-indiscernible sequence. 

Proof. This is clear from the previous proposition. □ 

We let Autf{U/A) be the set of automorphisms of U fixing Lascar strong- 
types over A. In other words, it is the set of a G Aut{U/A) such that 
Lstp(a/A) = Lstp(<r(a)/A) for all a. It is also the subgroup of Aut(U) gene- 
rated by the groups Aut{U/M) for M a model containing A. 
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We say that a global type p is Lstp^-invariant if it is invariant under 
Autf{U/A). This is equivalent to saying that if (a» : i < uS) is an A-indiscernible 
sequence in 1A and a \= p, then (a* : i < w) is indiscernible over Aa. 

Definition 5.4. We say that two tuples a and 6 have the same compact strong- 
type over A (or Kim-Pillay strong- type), written KP-stp(a/A) = KP-stp(6/A) 
if a-Efr holds for every bounded type-definable over A equivalence relation E. 

Therefore "equality of compact strong-types over A" is the finest type- 
definable over A bounded equivalence relation. 

We let AutiU / bdd(A)) be the set of automorphisms of U fixing all compact 
strong types over A. We say that a global type p is bdd(A)-invariant if it is 
invariant under Aut(U / bdd(A)). 

Note that Lascar strong-types refines compact strong-types refines types. 
If A = M is a model, then all those notions coincide. 

Proposition 5.5. Let p be a type over A and let R(x; y) be an A-invariant 
relation whose restriction to p(U) 2 is an equivalence relation with boundedly 
many classes, then: 

(i) there is an A-invariant bounded equivalence relation E whose restriction 
to p{U) 2 coincides with R; 

(ii) if furthermore R is type- definable, then there is a type- definable (over 
A) bounded equivalence relation E whose restriction to p{U) 2 refines R. 

Proof. Point (i) is easy: simply take E(x;y) to be (R(x;y) A p(x) A p(y)) V 
(^p(x) A -^p{y)). 

Assume now that R is type definable. Let (oj : i < a) be a family of 
representatives of each class of p(U)/R. Let E(x;y) be the relation stating 
that there are (a'^a" : i < a), all satisfying p such that R^a'^ai) A R(a",ai) 
holds for each i < a and tp(x(a^) i<Q ,/74) = tp(y(a'-) i<a /A). Then E is type- 
definable over A(ai) i<a . However, choosing another set of representatives yields 
the same relation E, therefore E is A-invariant and thus type-definable over 
A. It is clear that E is an equivalence relation. If x and y have the same 
type over A(ai) i<a , then they are ^-equivalent (take a\ = a" = dj), therefore 
E is bounded. Finally, if x and y satisfy p and are ^-equivalent, then this 
is witnessed by some (a-,af : i < a). For some i < a, we have R^x;^), 
therefore also R(y; a") holds and then R(x; y) holds. Thus E has the required 
properties. □ 
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5.2 Forking 

Definition 5.6. Let AcU. 

(i) A formula (j)(x; b) G L(U) divides over A if there is an A- indiscernible 
sequence {pi : i < u) where b = b such that the partial type 6j) : i < u} 
is inconsistent. 

(ii) A partial type ir(x) divides over A if it implies (equivalently, contains) 
a formula which divides over A; 

(iii) A partial type ir(x) fork over A if it implies a finite disjonction \/ i<n (f>i(x; U) 
of formulas, such that (f>i(x; b l ) divides over A for each % < n. 

The point of considering forking instead of dividing lies in observation 4 
below. 

Fact 5.7. 1. If p(x) is a consistent complete type over A, then it does not 
divide over A. (However, it may fork over A, see Example \5. 81 ) 

2. If the partial type tc(x) forks over A, then some finite 7To(x) C ir(x) forks 
over A. 

3. If p(x) is a global complete type, then it does not fork over A if and only 
if it does not divide over A. 

4- Let A C B and 7r(x) be a partial type over B, then tt(x) does not fork 
over A if and only if it has an extension to a global complete type which 
does not divide (equiv. fork) over A. 

5. If A C B C C and a G U, then iftp(a/C) does not divide over A, it does 
not divide over B. 

6. If p(x) is a global Lstp A -invariant type, then it does not divide ( equiv. 
fork ) over A . 

Proof. 1. If ir(x) implies a formula bo) and (pi : % < u) is an A-indiscernible 
sequence, then tt(x) implies all formulas <j)(x; bi), % < u>. As tt(x) is consistent, 
it follows that (j)(x;b ) does not divide over A. 

2. Clear by compactness. 

3. Assume that p(x) implies a finite disjunction of formulas \J i<n (fii(x',b), 
each of which divides of A. There is some finite Cq C U such that p'(x) = 
p{x)\c already implies this disjunction. Let C = Cq U A. Then we may find 
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a tuple b' G U, b' =c b. Then p'(x) implies \J i<n 4>i(x; b'), hence one of the 
formulas 4>i(x; b') is implied by p(x), and p(x) divides over A. 

4. Let 7r(x) be a partial type over A. Let be the set of formulas 
b) where 6) G L{U) divides over A. If n(x) US(s) is consistent, then 

it extends to a complete type over U which does not divide over A. Otherwise, 
by compactness, there is some finite part T>q(x) of such that ir(x) U S (x) 
is consistent. Hence tt(x) forks over A. 

5. Clear from the definition. 

6. Assume that p(x) h <p(x; b) for some <f)(x] b) G L(U), and let {pi : i < u) 
be an A-indiscernible sequence with &o — b. Then as is Lstp^-invariant, 
p(x) h 0(x; 6j) for each i < lu. Hence as p is consistent, so is the conjunction 

Example 5.8. We give an example of a type p G S(A) which forks over A. 

Let U be the usual unit circle in the plane and define a ternary relation 
R(x, y, z) on U which holds if and only if x and z are diametrically opposite, 
or y lies in the (closed) small arc between x and z. Consider the structure 
M = (U; R). Let p G Si(0) be the unique 1-type over 0. We claim that p forks 
over 0. 

To see this, let b,c,d G U divide the circle into three small arcs. Then 
p h R(b, x, c) V R(c, x, d) V R(d, x, b). It is enough to show that say R(b, x, c) 
divides over 0. We can find an indiscernible sequence (bi, Ci : % < u) such that 
(&o,Co) = (b,c) and the points 6 ) c 0)^i) c i, • • • lie i n that order on the circle. 
Then the formula R(b ,x, c ) A R(bi,x,Ci) is inconsistent. This finishes the 
proof. 

Note that this theory is NIP, since it is interpretable in RCF. 

Lemma 5.9. Let A,BcU and ir a partial type over B. Then the following 
are equivalent: 

(i) 7r does not divide over A; 

(ii) for every A-indiscernible sequence I — (q : i < ou) with cq G B, there 
is a \= 7r such that I is indiscernible over Aa; 

Proof. It is clear that (ii) implies (i). 

Conversely, assume (i) and let / = (q : i < u) with Co G B be an A- 
indiscernible sequence. Increasing I if necessary, we may assume that Co enu- 
merates B. Write ir(x) = ir(x; Cq). By the non-dividing assumption, the partial 
type {tt(x; q) : i < u} is consistent. Let a' realize it. By Ramsey, we may find 
an Aa'-indiscernible sequence (c^ : i < u) which is based on (q : % < u) over 
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Aa'. Then tp(c' /Aa') = tp(c /Aa'). Let / £ Aut(U/B) send (c- : i < to) 
to (q : % < u) and set a = f{a'). Then the sequence / is indiscernible over 
Aa. □ 

Lemma 5.10. (T any theory) Let A C B d U and a,b £ U. Assume 
that tp(a/B) does not fork over A and tp(b/Ba) does not fork over Aa, then 
tp(a, b/B) does not fork over A. 

Proof. Let M be some |.B| + -saturated model containing B. As the type tp(a/B) 
does not fork over A, it has an extension p' over M which does not fork over 
A. Let a' realize p'. There is an automorphism a fixing B and sending a to 
a'. Replacing (a, b) by (<r(a), a(b)) we may assume that tp(a/M) does not fork 
over A. Similarly, as the type tp(b/Ba) does not fork over Aa, it has an ex- 
tension q' to a type over Ma which does not fork over Aa. Replacing b by a 
realization of q', we may assume that tp(b/Ma) does not fork over Aa. 

Now it is enough to prove that tp(a, b/M) does not divide over A. So assume 
that it does divide and take <p(x, y; c) £ tp(a, b/M) and an A-indiscernible 
sequence (cj : % < to), Co = c such that {4>(x, y; q) : i < u} is inconsistent. 
By Lemma I5.9[ there is an automorphism / fixing Ac pointwise such that 
(q : i < oj) is indiscernible over f(a). Then tp(f(b)/Af(a)c) does not divide 
(indeed does not fork) over Af (a). Therefore there is an automorphism g fixing 
Af(a)c such that (/(a)Q : i < u) is indiscernible over g(f{b)). We then have 
|= (p(f(a),g(f(b)); q) for all i < u. Contradiction. □ 

Corollary 5.11. (T any theory) Assume that for every A <zlA andp £ S\{A), 
p does not fork over A, then for every A, no p £ S n (A) forks over A. 

Proposition 5.12. (T is NIP) Let AcW, and let p £ S(U) be a global type. 
Then p does not fork over A if and only if it is Lstp A -invariant. 

Proof. Right to left has already been observed and holds in any theory. 

We show left to right. Let p £ S(U) and assume that p is not Lstp^- 
invariant. This implies that there is some A-indiscernible sequence (q : i < 
u) and some formula <f)(x;y) such that p h <f)(x; c ) A -i0(x;cx). Consider 
the A-indiscernible sequence of pairs (c2iC2i+i : i < cu). The partial type 
{(f)(x; C2i A C2i+i)} is inconsistent by NIP. This implies that the formula 

4>(x; Co) A c\) divides over A. □ 

Corollary 5.13. (T is NIP) Let M -<U, andp be a global type. Then p does 
not fork over M if and only if p is M -invariant. 
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Let p G S(U) be a global type. If p is M-invariant, for some small M, then 
it does not fork over M. Conversely, if p does not fork over some small A cU, 
then it is M-invariant for any model M containing A. So for a global type p 
the properties u p is invariant over some small M C U" and u p does not fork 
over some small A C U n are equivalent. We call such types global invariant 
types. 

Corollary 5.14. (T is NIP) Let p G S(A) be a type in a finite number of 
variables. Then p has at most 2' A ' + ' T ' non-forking global extensions. 

Proof. Take M a model containing A such that \M\ — \A\ + \T\. If q is a non- 
forking extension of p, then q does not fork over M, therefore it is M-invariant. 
We have seen in the proof of Proposition 12.311 that there are only 2' A/ ' global 
M-invariant types. □ 



5.3 bdd(A)-invariance 

In this section, we assume that T is NIP. 

Proposition 5.15. Let p G S{U) be non-forking over A. Then the equiva- 
lence relation Lstp(x/v4) = Lstp(y/A) restricted to realizations of p\a is type- 
definable. 

Indeed, if c,d \= p\a, then: Lstp(c/y4) = Lstp(d/A) if and only if there is 
some sequence I such that both (c) + / and (d) + I realize p^'\a- 

Proof. It is enough to prove the second statement. 
<^=: Clear 

=>-: Let c, d realize p\a, and assume that Lstp(c/A) = Lstp(d/A). Conjugat- 
ing by an automorphism over A, we may assume that Lstp(c/A) = Lstp(ao/^4), 
for a |= p. We let / |= p^\Acda - Then a + / is a realization of p^\a- As 
p^ does not fork over A, by NIP it is Lascar-invariant over A, therefore both 
(c) + / and (d) + 1 realize p (aj) . □ 

Corollary 5.16. If p £ S(A) does not fork over A, and a,b \= p satisfy 
KP-stp(a/A) = KP-stp(6/A), then Lstp(a/A) = Lstp(6/A). 

In particular, if no type forks over its base, then compact strong types co- 
incide with strong types. (We say that T is G-compact). 
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Proof. Let p G S(A) be non-forking over A. By the previous proposition, the 
relation R(c;d) = Lstp(c/A) = Lstp(<iA4) is type-definable on realizations of 
p. By Proposition 15. 5[ there is a type-definable bounded equivalence relation 
E which refines R on p (and must therefore be equal to R by definition of 
Lascar strong types). If c, d |= p have the same KP-strong type, then they are 
-E'-equivalent and thus /^-equivalent. □ 

We now prove an analog of Proposition 12.271 for types non-forking over A. 

Proposition 5.17. Letp,q G S(U) be two global types, both non-forking over 
A and let a \= and b |= q^K Assume that Lstp(a/A) = Lstp(b/ A) , then 
p = q. 

Proof. The proof is essentially the same as that of Proposition 12. 27\ replacing 
types over A by Lascar strong types over A. 

Assume that there is some cj)(x] d) G L(U) such that p h </>(x; d) and q h 
-«j)(x]d). Pick any model M I) A U {d}. We build inductively two sequence 
(a n : n < oj) and (b n : n < to) such that for all n: 

- d n h P (u) \ Ma <n b <n - 

- b n h q {u) r Ma< n 6 <n . 

It is enough now to show that the concatenation a + b + ai + bi + . . . 
is indiscernible, because then the formula 0(x; d) alternates infinitely often on 
that sequence, contradicting NIP. 

We show by induction on n that /„ := a + bo + . . . + a n has the same 
Lascar strong type over A as a realization of g( w ( 2n+1 )). For n — 0, this is the 
hypothesis. Assume we know it for n. Then as q^ u ' is Lascar-invariant over 
A and J n has the same Lascar strong type over A as g^ 2 ™ -1 " 1 )), the sequence 
I n + b n realizes q( w ( 2n+2 )>\ A . Therefore it is an indiscernible sequence. It follows 
by Lemma r5.3l that /,„ + b n has same Lascar strong type over A as a realization 
of g( w ( 2n+2 )) , which is the same as that of a realization of p^ 2n+2 ^ . We then add 
a n+ i to this sequence by the same argument and this finishes the induction. □ 

Corollary 5.18. Let p G S{U) be non-forking over A, then p is bdd(y4)- 
invariant. 

Proof. Let a G Aut(U / bdd(A)), a \=p {uj) \ M - Then 

KP-stp(a/A) = KP-stp(a(a)/A). 

As a and a (a) are both realizations of p^'\ai which is non-forking over A, 
Corollary 15.161 implies that Lstp(a/A) = Lstp(<r(a)/A). Therefore by Proposi- 
tion EHZl p = a (p). □ 
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5.4 NTP2 and the broom lemma 

Definition 5.19. (T any theory) We say that a formula 4>(x;y) has TP 2 if 
there is an array (b\ : % < u,t < 00) of tuples of size \y\ and k < u such that: 

- for any 77 : uj — > w, the conjunction At< w ^(t)) * s consistent; 

- for any t < u, {<f>(x; bf) : i < u} is ^-inconsistent. 

We say that y) is NTP 2 if it does not have TP 2 . We say that the 
theory T is NTP 2 if all formulas are NTP 2 . 

Proposition 5.20. IfT is NIP, then it is NTP 2 . 

Proof. Assume that some formula <p(x; y) has TP 2 , as witnessed by some k < u 
and array (a* : i < lu, t < u). We show that VC(0) = 00. 

For simplicity, we assume that k = 2, and we explain afterwords how to 
handle the general case. 

Let A = {oq : t < u} and Aq C A any subset. Let 77 : u — > u such that 

v(t) = l ' iiteA 

1 otherwise. 

By the properties of the array, there is some fe^o |= At<u> Then using 

2-inconsistency, we have 

b Ao \= 4>{x] a„) t £ A . 

Thus the formula 4>(x; y) has IP. 

We indicate two ways to treat the case k > 2. One option is first to increase 
the array to (a* : i < u, t < k), by compactness. Next using standard Ramsey 
argument we may assume that the sequences {{a\ : i < uj) : t < u} are mutually 
indiscernible. Then we conclude by Proposition 14.71 Another argument is by 
counting types. Let iV be large enough and for any s < u consider the finite 
set A s = {a* : % < N, t < s}. Then the properties of the array imply that there 
are at least (N/k) s ^-types over A s . As \A S \ = N ■ s, this contradicts Sauer's 
lemma 16. 31 

□ 

The class of NTP 2 theories contains both NIP theories and simple theories. 
Most of the content of this section and the next one goes through (with some 
slight modifications) for NTP 2 theories. However, we do not try to be optimal 
here and we will assume NIP throughout. 
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Notation 5.21. a \^ A b means that tp(a/Ab) does not fork over A. 
Note that this is not in general a symmetric relation. 

Definition 5.22. An extension base is a set A C U such that no p G S(A) 
forks over A. Equivalently (using NIP), any p G 5(A) has a global extension 
which is Lstp^-invariant. 

Definition 5.23. Let / = (a^ : i < uj) be a sequence of tuples. We say that it 
is a Morley sequence over A if there is some global type p, non-forking over A 
such that I \= p^\ A - 

Lemma 5.24. Let b = (bi : i < uj) be an A-indiscernible sequence such that 
p = tp(b/A) does not fork over A. Then there is some global non-forking 
extension q(x^, X\, . . .) of p which is the type of some IA -indiscernible sequence. 

Proof. Let p' be any non- forking extension of tp(b/A). Let (di : i < uj) \= p' . 
We can find some W-indiscernible sequence (e^ : i < uj) which is based on the 
sequence (di: i < uo) over U. Let q = tp(ej : % < to). Then q does not fork over 
A and its restriction to A is p. □ 

Lemma 5.25. Let A be an extension base, and 4>(x;b) G L{U) divide over A. 
Then there is a Morley sequence (bi : i < uj) over A witnessing dividing (i.e., 
b = b and the conjunction /\ i<0J <ft(x; bi) is inconsistent). 

Proof. Let (6- : i < u) be some A-indiscernible sequence with b' = b and such 
that {4>(x; b'j)} is /c-inconsistent. Let p = tp((6-)i<w/A). By assumption on A, p 
does not fork over A, so by the previous lemma, there is some q(xo, . . .) G S{U) 
a non-forking extension of p and some r(x) G S{U) such that the type q 
restricted to any one of its variable is equal to r. Let (6* : t < u) be a Morley 
sequence of q over A, where 6* = (b\ : % < u). 

For each t < cu, the type {4>(x; bj) : i < u)} is /c-inconsistant. By NTP 2 , 
there is some r\ : uj — >■ to such that f\ t<U) b^ is consistent. As the sequence 
(fr^ f ) : t < uj) is a Morley sequence of r over A, we have what we want. □ 

Lemma 5.26 (Broom lemma). Let A C U be an extension base and n(x) be a 
partial type over U which is Lstp^ -invariant. Assume that for some formulas 
ij)(x; b) and <fti(x; c), i < n in L(U) we have 



tt(x) h ip(x; b) V V (f>i(x; c), 

i<n 
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where b c and for each i < n, <pi(x; c) divides over A. Then 

tt(x) h i]){x\ b). 

Proof. We prove the result by induction on n. The case n = is trivial. 

Assume we know it for n and let 7r(x) h ip(x; b) V Vi< n +i 4>i{ x \ c )- By the 
previous lemma, there is a Morley sequence (cj : i < u), cq — c such that 
{4>(x; Ci) : i < u} is ^-inconsistent. Conjugating by an automorphism, we may 
assume that b A\, A (ci)i <0J , thus the sequence (q)j< w is indiscernible over Ab 
(since tp(6/A(cj)) is Lstp^-invariant). 

By Lstp^-invariance of vr(x), we have 7r(x) h ij)(x\ b) V \J i<n+1 4>i{x] Cj) for 
all j < u. In particular, we have 

ir(x) h ^(x; 6) V f\ y (f>i(x;cj). 

j<k i<n+l 

By assumption on fc, this implies 



7T 



(x) h V Y &(z;c,-). 



j<k 

i<n 



For any j < k, we have 6 J^c^-, thus ^ J^ Ac c >i- Also we have c>j J^ A c i- 
Therefore by transitivity (Lemma 15. 10p bc>j A^.cj. 
By the induction hypothesis, and as 



tt(x) h 



n-^j c j. 



0<j<k 



V \J <j>i(x\Cj) 



i<n 



we have 



7r(x) h tp(x; &) V Y <M X ; c i)- 



0<j<k 
i<n 



Iterating, we obtain tt(x) h ^(x; 6). 



□ 



Remark 5.27. If in the hypothesis of the lemma, we remove the assumption 
that b X<a C) then the conclusion becomes 



7r(x) h Y ^(^; &i 



for some biS satisfying Lstp(&j/A) = Lstp(6/A). 
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Proof. Let vr(x), ip(x;b) and <f) l (x;c) be as in the hypothesis of the lemma, 
except that we do not assume b \, A c. Consider the partial type tt'(x) = 
ir(x) U b') : Lstp(6'/A) = Lstp(6/A)}. Then 7r'(x) is Lstp A -invariant 

and by hypothesis on vr(x), we have n'(x) hi^iV c). The previous 

lemma applies to give 7r'(x) \- x ^ x. Therefore ir'(x) is inconsistent and the 
conclusion follows by compactness. □ 

Corollary 5.28. Let M \= T and p G S(M). Then p has a non-forking heir 
over M. 

Proof. Assume not. Then the partial type n(x) = p{x) U {ip(x;b) : b G 
U,p h ip(x;b'),Wb' G M} U {4>(x; c) : c G U , -«j)(x; c) divides over M} is in- 
consistent. By compactness, we find a formula ip{x; b) from the first set and 
finitely many formulas <j) l (x]c), i < n from the second set such that p(x) h 
->ijj(x;b) V ~^\/ i<n (j) l (x;c). By Remark [5271 we have h V i<m ^(x;bi), 
where 6, =m 6. As p is a type over M, by compactness, there is some formula 
9(xq, . . . , x m _i) such that 

p(x) h Vy , • • • , Vm-iQ{y) -»■ V/ "^(^ 

Taking such yo, . . . ,y m -i m ^ contradicts the choice of formulas t/)(x;y). 

□ 

5.5 Strict non- forking 

Definition 5.29. Let A C B and a ElA. We say that p = tp(a/B) is strictly 
non-forking over A if there is a global extension p' of p such that p' is non- 
forking over A and such that for all C C W, if ao |= p'loB; then C ao- 

We will write a 6 to mean that tp(a/Ab) is strictly non-forking over A. 

Remark 5.30. In particular, if tp(a/B) is strictly non-forking over A, then we 
have a A^ A B and B \^ A a. It is an open question whether the converse holds 
(assuming that A is an extension base). 

Example 5.31. Let p G S(M), then p is strictly non-forking over M. This is 
witnessed by any global non- forking heir p' of p (using Corollary 15.281) . 



We now generalize this to types over arbitrary extension bases. 
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Proposition 5.32. Let A be an extension base and p £ S(A). Then p is 
strictly non-forking over A. 

Proof. Let a \= p. The proof is essentially the same as that of Corollary 15.281 
Namely, we consider the partial type p(x) U {ip(x; b) : b £ W, ^ip(a;y) divides 
over A} U c) : c £ W, c) divides over A}. If this type is inconsistent, 

then we can find some ip{x; b) from the first set and finitely many (p l (x; c), % < n 
from the second set such that 

p{x) I — <ip(x; 6)Vy c). 

i<ra 

By Remark f5. 2 71 we havep(x) h \j i<m ->ij}{x\ bi) where Lstp(6j/A) = Lstp(&A4). 
Let 6 = b * . . . A & m _i and take g £ S'(W) a non-forking extension of tp(b/A) (as 
A is an extension base). If V \= q\Aa, then by hypothesis on ip(x;y), i/j(a;bi) 
holds for all i. A contradiction. □ 

Definition 5.33. The sequence (a, : % < a) is a strict non-forking sequence 
over A, if for any i < a, we have aj a<j. 

Lemma 5.34. Let A be an extension base. Let (aj : i < a) be a strict non- 
forking sequence over A and (Jj : i < a) a sequence of A-indiscernible sequences 
such that Li begins with ai. Then we can find sequences (Jj : i < a) mutually 
indiscernible over A such that Ji =Aa t h f or each i < a. 

Proof. Without loss, the sequences Jj's are indexed by u. We may assume that 
a = n < u and we show the result by induction on n. Assume it is true for n 
and let (a« : i < n+1) and (Jj : i < n+1) be given. By the induction hypothesis, 
we may assume that the sequences {Ii : i < n} are mutually indiscernible over 
A. We have a n J^a< n . Conjugating by an automorphism fixing A and the 

sequence (a^ : i < n + 1), we may assume that a n a <re / <n . This implies: 

(1) the sequences {/« : i < n} are mutually indiscernible over Aa n ; 

(2) a <n I <n ^ A a n . 

By Lemma 153)} tp(a <n I <n /Aa n ) has an extension q over AI n such that I n is 
indiscernible over a realization of q. Therefore moving I n by an automorphism 
over Aa n , we may assume that I n is indiscernible over Aa <n I <n . 

For k < n + 1, write Ik = (af : i < uS). Let 77 : n + 1 — > uj be any function. 
As /„ is indiscernible over AI <n , we have 

%(0) ' ' ' %(n) —A %(o) " " " %(n-l) a n- 
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Then by (1) above, 

n 

a v(0) ' ' ' a v{n) =A a ' ' ' a n- 

By Ramsey's theorem, we can find sequences (I' k : k < n) mutually indis- 
cernible over AI n and based on (I* : k < n) . Write I' k = (a'* : i < u). Then by 
the previous observation we have a'fj ■ ■ -a'^ 1 =Aa n do • • -ctn-x- Therefore we 
may assume that a' = for all k < n. 

Then the sequences I' , . . . , I' n -i, In have the required properties. □ 

Proposition 5.35. Let A be an extension base, and let (a, : % < k) be a strict 
non-forking sequence over A. Let b such that dp-rk(6/A) < k. Then there is 
i < k such that tp(6/Aaj) does not divide over A. 

Proof. Assume that tp(b/Aai) divides over A for all i < k. Then, for each 
i < k we can find some 0$(a;;aj) G tp(6/Aoj) and an A-indiscernible sequence 
Ii = (dj : j < oj) such that a l = and /\j <UJ <fii( x 'i a )) is inconsistent. By 
the previous proposition, we may assume that the sequences Jj are mutually 
indiscernible over A. Then for % < k, Ii is not indiscernible over Ab, which 
contradicts the definition of dp-rank. □ 

We will show in Theorem 15.371 that forking equals dividing over extension 
bases, therefore one can replace "does not divide" by "does not fork" in the 
previous proposition. 

Proposition 5.36. Let A be an extension base and (a, : % < u) indiscernible 
and strict non-forking over A. Assume that the formula 4>(x;ao) divides over 
A, then the partial type {<f)(x; ai) : i < u} is inconsistent. 

We say the strict non-forking sequences witness dividing. 

Proof. Assume that {<p(x; a^) : i < u} is consistent. Then we may increase the 
sequence to some A- indiscernible (a^ : % < \T\ + ), strict non-forking over A such 
that there is b \= /\ i<: , T , + <j>(x; aA . Then tp(6/Aij) divides over dj for each i, 
and this contradicts Proposition 15.351 □ 

Theorem 5.37. Let A be an extension base and <f)(x; a) a formula, then <fr(x; a) 
forks over A if and only if it divides over A. 



5.5. STRICT NON-FORKING 



67 



Proof. If 4>{x; a) divides over A, then it forks over A. Conversely, assume that 
4>(x;a) forks over A. Then there are formulas t/)i(x;c) G L(U), i < n, such 
that a) h \f i<n ipi(x; c) and each -0j(x;c) divides over A. Let g G 5"(W) 
be a strict non-forking extension of tp(ac/A) (using Proposition 15.32) ) and let 
(ajQ : i < oj) be a Morley sequence of g over A. If a) does not divide over 
A, then there is b \= /\ i<UJ <p(x; a^). Then trimming the sequence if necessary, we 
may assume without loss that b \= /\ i<UJ ipo(x\ q). This contradicts Proposition 

EM □ 
Lowness 

Let A be any set of parameters and let <p(x; y) be a formula over A. Let k be 
the alternation number of <p(x\ y), that is the maximal k for which we can find 
an A-indiscernible sequence {pi : i < u) such that f\ i<k ->{4>{x] bi) <r> b i+ i)) 
is consistent. Let now (pi : i < oj) be any A-indiscernible sequence. Assume 
that f\ i<k+1 4>(x; bi) is consistent and we show that f\ i<UJ 4>(x; bi) is consistent. 

To see this, increase the sequence to an A-indiscernible (bi : i < u ■ (k + 1)). 
By indiscernability, f\ i<k+1 4>(x; b^.i) is consistent and let a realize it. As k is the 
alternation number, there is some i < k+1 such that a realizes f\j <UJ <P(x] b^-i+j), 
otherwise the truth value of 0(a; bi) would alternate too much on the sequence. 
Thus again by indiscernability, /\ i<UJ <f>(x; bi) is consistent. 

Proposition 5.38. Let A be an extension base and(p(x;y) a formula over A. 
Then the set ofb's such that 4>(x;b) forks over A is type-definable over A. 

Proof. By the paragraph before the proof (and Theorem 15.371) . we know that 
4>(x; b) forks over A, if and only if it divides over A, if and only if there is some 
A-indiscernible sequence (bi : i < u) such that: 

- b = b; 

- Ai<ait(<«+i <K x ; b i) is inconsistent. 

This is easily seen to be a type-definable condition on b. □ 

Theories satisfying the conclusion of the previous proposition are called low 
theories. Thus NIP theories are low. 

Historical Remarks 

The definition and general properties of forking and dividing are due to Shelah. 
See for example his book [81]. Properties of forking specific to NIP theories 
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presented in Section IST21 are proved in Shelah's jH3] and Hrushovski and Pillay's 
[17] . The proof that forking equals dividing is from Chernikov and Kaplan's 
[2Tj . We follow closely the simplified account given by Adler in pQ. The 
definition and properties of strict non-forking appeared in [83] with slightly 
erroneous statements and proofs. It is investigated in [SI]. 



Chapter 6 

Finite combinatorics 



In this chapter, we introduce tools from finite combinatorics and probability 
theory. 

It was first observed by Laskowski in [57] that the notion of NIP formula 
had an analogue in combinatorics under the name finite VC- dimension. More 
precisely, a formula 4>{x\ y) is NIP if and only if the class {0(M; b) : b G M} 
of subsets of M' x \ has finite VC-dimension. This is a central notion in ma- 
chine learning theory, as classes of finite VC-dimension coincide with learnable 
classes. See [30J. This theory builds on the theorem of Vapnik and Chervo- 
nenkis (from which the name VC originates) which states a uniform law of 
large numbers for such classes. 

We give a proof of that theorem and of the so-called (p, ^-theorem of Alon- 
Kleitman and Matousek. The later could be restated in model-theoretic terms, 
but no proof is known using the classical tools of first order logic. We will there- 
fore change the framework in this chapter and consider a purely combinatorial 
setting. We will come back to first order structures at the end to prove unifor- 
mity of honest definitions. 

6.1 VC-dimension 

We consider the following situation: X is a set (finite or infinite), and S is a 
family of subsets of X. Such a pair (X,S) is called a set system. For most 
purposes, we can forget about the base set X, or in other words, take X to be 
US. 

Let A C X. We say that the family S shatters A , if for every A C A 0l 



69 



70 



CHAPTER 6. FINITE COMBINATORICS 



there is a set S in S such that S D A = A. 

The family S has VC-dimension at most n (written VC(<S) < n), if there 
is no A C X of cardinality n + 1 such that 5 shatters A . We say that 5 is of 
VC-dimension n if it is of VC-dimension at most n and shatters some subset 
of size n. 

If for each n we can find a subset of X of cardinality n shattered by S, then 
we say that S has infinite VC-dimension (and write VC(«S) = oo). 

For a more careful analysis, we define the shatter function its from N to 
N as follows: irs(n) is the maximum over all A C X of cardinality n of 
|{C nA :C£ S}\. Note that 7r 5 is bounded by 2 n , and we have: 

VC(5) < n <=> 7r s (n) < T. 

Given a set system (X, S) , we define the dual set system as the set system 
(X*,S*), where X* = S and S* = {S a : a 6 X}, with 5 a = {5 e 5 : a 6 S}. 
We then define the VC codimension of 5 (written VC*(«S)) as the VC-dimension 
of S* . We also define the dual shatter function 7r£ as the shatter function of 
S*. 

Example 6.1. Let <f)(x;y) be a formula and fix a model M. Then <j)(x;y) 
is NIP if and only if the class S^M) = {<j>(M;b) : b £ M} is of finite VC- 
dimension. The VC-dimension of <p(x; y) as defined in Section [?!T1 coincides with 
VC-dimension of that class. In particular, it is independent of the choice of M. 
We similarly define tt^ and the dual objects. Note that the VC codimension of 
<ft(x; y) is the VC-dimension of 4>*(y; x) where (p*(y; x) = <f)(x; y). 

Lemma 6.2. We have VC*(<S) < 2 VC ^ +1 and VC(5) < 2 VC *( 5 ) +1 . 

In particular S has finite VC-dimension, if and only if it has finite VC- 
codimension. 

Proof. Assume VC(«S) > 2 n . Then there is some subset A C X of size 2 n 
shattered by S. Write A = {ac '■ C C n}. For each k < n, let Sk G S be such 
that Sfc fl A = {a c : {A;} C C C n}. Then easily, the family {Sk : k < n} is 
shattered by S*. It follows that VC* (S) > n. This proves the second inequality. 
The first one is proved similarly. □ 

The following lemma states that the shatter function 7rs(n) is either always 
equal to 2™, or has polynomial growth. 
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Lemma 6.3 (Sauer's lemma). Let S be a class of VC- dimension at most k. 
Then, for n > k, we have irs(n) < Y^k=o ■ 
In particular 7is(n) = 0(n k ). 

Proof. First notice that the bound is tight: take S to be the family of subsets 
of X of cardinality k. Then S has VC-dimension exactly k and we see that its 
shatter function is equal to the bound in the statement of the lemma. The idea 
of the proof is to reduce the situation to this case by modifying the elements 
of S, making them as small as possible without changing the cardinality of the 
family nor its VC-dimension. 

Fix an integer n > k. If 5 contradicts the bound, then this is also true of 
some finite subfamily of S, so without loss we may assume that S is finite. Sim- 
ilarly, we may assume that X is finite of cardinality n. Say X = {x\, . . . , x n }. 

We define iteratively families S , . . . ,S n . Set S = S. 

Let k < n, and assume that Sk has been defined. Go through the sets in 
iSfc one by one. For each S G Sk, if Xk G S, and if S\ {xk} is not a set in Sk, 
replace S by S \ {xk}. Let Sk+i be the resulting family. 

The following three facts can be easily checked by induction on k: 

(i) for each k, the cardinality of Sk is the same as that of S; 

(ii) let S G Sk and A — S fl {x±, ...,Xk}', then for every A C A, the set 
A U (S \ A) is in Sk', 

(iii) any A <0 X shattered by Sk is also shattered by Sk-i- 

Fact (ii) implies in particular that if S G S n , then 5* is shattered by S n . 
From (iii), it follows that the VC-dimension of S n is not greater than that of 
S. Therefore no set in S n can have cardinality greater than k. As |<S„| = |<S|, 
we obtain the required bound. □ 

We now define the VC density of S to be vc(5) = limsup nH>00 log ^^ ■ In 
other words, vc(S) is the smallest r > for which we have 7Ts(n) = 0(n r ). 
Similarly, we define the VC-codensity, as vc*(S) = vc(S*). 

By the previous result, we always have vc(5) < VC(5). For many purposes, 
the VC density is a more appropriate notion than the VC-dimension. 

We now state and prove the fundamental theorem of Vapnik and Chervo- 
nenkis. It is a uniform version of the law of large numbers for set systems of 
finite VC-dimension. 

First, we fix a notation. For S G S and (x\, . . . ,x n ) G X n , we define 
Av(xi, . . . ,x n ; S) as being equal to -1,9 fl {x\, . . . , x n }\. It is the measure 
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of S estimated on the finite set {x±, . . . ,x n }. The weak law of large num- 
bers (Proposition IC.3I) states that for fixed S G <S, and e > 0, we have 
/i n (| Av(xi, . . . ,x n ; S) — fJ,(S)\ > e) — > 0. It follows from its proof, that the 
speed of convergence is bounded by a function depending only on e. In other 
words, with high probability, sampling on a tuple (x±, . . . , x n ) selected at ran- 
dom gives a good estimate of the measure of 5". The VC-theorem states that 
if S is of finite VC-dimension, then sampling on a random tuple (xi, . . . ,x n ) 
gives a good estimate of the measures of all the sets in S. 

Theorem 6.4 (VC-theorem). Let (X,fi) be a finite probability space, and S C 
*$(X) a family of subsets, then for e > we have: 

ne 2 



j/ 1 I sup \Av(xi, . . . ,x n ; S) — fi(S)\ > e I < 4irs(2n) exp 

Proof. For x = (x±, . . . , x n ), x' = (x[, . . . , x' n ) and S G S, we let 5(x, x'\ S) 
be equal to | Av(xi, . . . , x n ; S) — Av(x' 1 , . . . , x' n ; S)\. Let also 5i(x,x';S) = 
±{l s {xi) - l s (xd). Thus 6{x,x';S) = | E i=x .. n S t (x, x';S)\. 

Claim : We have: 

/i 2n (sup5(x,x'; S) > e/2] < 27r 5 (2n) exp ( 

Proof: Fix some integer n and some tuples x, x' G X n . Fix also some S G S. 
For % < n, we let Z 2 act on (xj, £•) by flipping the two variables, and we let (Z2)™ 
act on x"x' by the product action. Let fls — {i G {1, . . . , n) : 5i(x, x'; S) ^ 0}. 

Let Y be the probability space (Z 2 ) n equipped with the normalized counting 
measure. We consider the random variable / : Y — > R defined by f(a) = 
S(a(x, x')\ S) and for each i, we let fi be the random variable defined by fi(cr) = 
Si(a(x, x'); S). Then f = \ J2 M- I* 1 ^ ac ^ f° r * ^ /i i s uniformly equal to 
zero. Thus we have / = | Y. i& n s fi\- 

For i G Og, the random variable /, satisfies Prob(/j = —l/n) = Prob(/j = 
1/n) = 1/2. Also, the variables /, are independent. Let As C Y be the set of 
a G Y for which f{o~) > e/2. By Chernoff's bound (Proposition IC.4) ). we have 



\A S \ < 2l^l +1 exp ( JIhlf) < 2™ +1 exp ( ^ 



The set As depends only on S(l{xi, . . . , x n , x[, . . . , x' n }. As S varies in S, there 
are at most irs(2n) values for the sets As- Thus A* = {J S £S As has measure at 
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most a := 2irs(2n) exp(— ne 2 /8). For a A*, we have sup Se5 S(a(x, x'); S) < 
e/2. 

Consider the space X 2n x Y equipped with the product measure. Define 
D C X 2n x Y by (x,x',cx) e D if sup 5e5 5(cr(x, x'); 5) > e/2. Then for fixed 
x,x' G X 2 ™, the fiber Df^,x') ^= Y has measure at most a. Therefore the 
measure of D is at most a. On the other hand, given a, a' G Y, the fibers 
D^, D^) have the same measure because the measure on X 2n is symmetric. 
Therefore that measure is at most a, and the claim follows. 

Now, we may assume that n > 2/e 2 , since otherwise the inequality is trivial. 
Let X C X n be the set of a G X n such that /i n (sup 5&s a; 5) > e/2) > 1 /2. 

We have /i n (X ) < 47r 5 (2n) exp (~sf V Pick a G X n \ X and some S E S. 

By Chebyshev's inequality, we have 

Prob(| Av(xi,...,x n ;S') -/x(S r )| > e/2) < l/(ne 2 ) < 1/2, 

(see the proof of Proposition \C3\i . It follows that there is x G X n satisfying 
both: 

- 5(a,x; S) < e/2; 

- | Av(x; S) - fi(S)\ < e/2. 

Remembering the definition of 5, this implies that | Av(a; S) — fi(S)\ < e. 
This finishes the proof. 

□ 

Remark 6.5. In fact the theorem is still true if X is infinite, but we have to 
add the following measurability assumptions: 

- each set S G S is measurable; 

- for each n, the function X n — > R, (x±, . . . , x n ) \-> sup 5€iS | Av(xi, . . . , x n ) — 
n(S)\ is measurable; 

- for each n, the function X 2n — > K, (xi, . . . , x n , . . . , x' n ) sup SeS S(x, x'; S) 
is measurable. 

The first condition implies the other two when the family S is countable, 
and of course they always hold when X is finite. The proof then goes through. 

To see how the second and third hypothesis might fail, consider the case of 
X = oj\. Let B be the a- algebra generated by the intervals. Let \x be defined 
on B by fJ,(A) = 1 if A contains an end segment of X and [i(A) = otherwise. 
Note that this defines a er-additive measure on (X, B). Let S to be the family 
of intervals of A. It has VC-dimension 2. We leave it to the reader to check 
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that the VC-theorem does not hold for S. (In view of Corollary 16. 6l it is enough 
to check that there are no e-nets, for e < 1). 

If (X, S) is a set system, an e-net for S is a finite subset Xq = {x\, . . . , x n } 
of X such that for all S 6 S, we have 



-is n {xi, . . . ,x n }| - /i(S') 

n 



< e. 



Corollary 6.6. Let > and e > 0, then there is N such that any set system 
S on a finite probability space (X,fi) with VC(S) < k admits an e-net of size 
at most N. 

Proof. By Sauer's lemma, we know that there is some polynomial Pk depending 
only on k such that ir s (n) < P k (n). Take N such that AP k (2N) exp (-^r) < 1 
and apply the VC-theorem 16.41 □ 



6.2 The (p, g)-theorem 

Let p > q be two integers. A set system (X, S) has the (p, g)-property if for 
every X\ C X of cardinality > p, there is S* G S such that \Xx fl SI > q. 

The following theorem will be an important ingredient in the proof of uni- 
formity for honest definitions. 

Theorem 6.7. Let p > q be two integers. Then there is an integer N such 
that the following holds: 

For any set system (X,S), assume: 

- X = US is finite; 

- VC(S) < q; 

- (X,S) has the (p,q) -property. 

Then we can find S±, . . . , Sn £ S such that [j i=l N Si = X . 

Actually, we will only need a special case of this theorem with p = q big 
enough with respect to VC(S). We state this as a separate corollary. We will 
only prove this corollary (without using the theorem) and refer the reader to 
the references for the proof of the full theorem. 



6.2. THE (P,Q) -THEOREM 
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Corollary 6.8. Let k G N, i/ien t/iere are £u>o integers q and N such that for 
every finite X and S C ?fi(X) a familiy of VC-dimension at most k, if for 
every Xq C X of size < q, we can find S G S containing Xq, then there are 
Si, . . . , Sn £<S whose union is the whole of X . 

Proof. (Of the corollary) 
Let e = 1/3. 

By Corollary 16. 6[ there is some q such that for every set system S of VC- 
dimension <l;ona finite set X and any probability measure fi on X, there 
are Xi, . . . , x q G X such that for any S G S, 



Let (X, S) be a set system of VC-dimension < k with X finite and having 
the (q, g)-property. Then it follows that for any probability measure \i on X, 
we can find some S G S with fi(S) > 1 — e. 

We now need the following result, known as Farkas' lemma. We refer the 
reader to any introductory text on convex analysis or linear programing for a 
proof. See for example [TSJ page 90]. 

Fact 6.9 (Farkas' lemma). Let A be a matrix in Ai m ,n(M) an d b G M. n , then 
the following are equivalent: 

(i) 3x G such that Ax < b; 

(ii) for all y G A^i, m (R+), yA > implies yb > 0. 

Write X = {xi, . . . ,x m -i} and S = {Si, . . . , S n }. We define a matrix 
A G M. n ,m by setting = 1 for all j < n, and for i < m — 1 and j < n, we 
set Ai j = —1 if Xi G and otherwise. We also define b G M m by foj = — (1 — e) 
for i < m — 1 and 6 m = 1. 

We check that condition (ii) of Farkas' lemma is satisfied. So let y = 
(cii, . . . , a m _i, a m ) G A / li im (]R + ) be such that yA > 0. This means that for any 
j < n, we have Ya=i a i A ij - °- Hence, for any j < n, V ; , :r ; r ; a, < a m . 

Let a* = X/i<m-i a «- -^ e ^ A* ^ e ^ ne measure on ^ defined by giving weight 
a^/a* to the point xi, for all % < m — 1. Then the previous inequality implies 
that no set Sj has measure greater than a m /a*. On the other hand, by what 
we have shown above, there is some 5e5 such that jl(S) > (1 — e). It follows 
that a m /a* > 1 — e. And this gives exactly yb > 0. 

By Farkas' lemma, we conclude that (i) above holds, namely that there is 
some d = (di, . . . ,d n ) T , with each dj > such that Ad < b. Decoding, this 



\{i:XieS}\ 



< e. 



n 
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gives that for i < m — 1, J2j. x . eSj dj >1 — e and Y^=\ dj < 1- Increasing do if 
necessary, we may assume that actually Y^j=i dj — 1- 

We now consider the dual set system: to i G X, we associate E x = {S E 
5:iGS}. Let £ = {E x : x E X}. Then VC(£) < 2 k+1 . Equip the set S by 
the measure /i* defined by the weights di. By Corollary 16.61 again, we can find 
N depending only on k and e and S[, . . . , S'i E S such that for all x E X, 



H*{E X ) - 



\{l:xES[}\ 



N 



< e. 



For any x E X, we have /i*(E x ) = J2\j X es } dj > 1 — e > e, thus |{Z : x E 



S[}\ > 0. It follows that lXi S{ = X 



□ 



6.3 Uniformity of honest definitions 

We come back to the model theoretic context. We assume that our ambiant 
theory T is NIP. 

Recall from Theorem 13.11 (honest definitions) and the remark following it, 
that given M \= T, A C M and <j)(x; b) E L(M), there is some ip{x; z) E L such 
that for any finite Aq C (f)(A; b) we can find d E A with Aq C ip(A; d) C 6). 

We now address the question of uniformity of ip with respect to 0. First, 
compactness gives a weak uniformity statement. 

Lemma 6.10. Let <j)(x;y) E L. For any formula ip(x;z) (where x is the same 
variable as in <fi and z may vary), let an integer be given. Then there are 
finitely many formulas ip , . . . , ip n -i such that: 

for M \= T , A C M , b E M , there exists j < n, such that for any Aq C 
4>(A; b) of size < there is some d E A with A C if}j(A; d) C (f>(A; b). 

Proof. Consider the language V = L U {P(x); Cb}, where q, is a new constant. 
Consider the //-theory T' axiomatized by T along with the formulas G^,, for 
i>{x; z) E L, where 

9^ = (3x , • • • , X^-i E P) f\ <j)(Xi\ Cb) A ~^3d E P 

Ai/j(xi] d) A Vx E Pi/j(x; d) ->■ c b ). 
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By Theorem 13.11 as recalled above, the theory T' is inconsistent. Therefore 
by compactness, there are finitely many formulas ipo, . . . ,ip n -i such that T U 
{6^ , . . . , 6^, n _ 1 } is inconsistent. Then we have what we want. □ 

Theorem 6.11. Let 4>(x;y) G L. There exists ?p(x; z) G L such that for any 
M \=T, A C M of size > 1, b G M and A C <p(A; b) finite, there is d G A 
with 

A Cip(A; d)C0(A; b). 

Proof. For any formula ip(x;z), let be the VC-dimension of ip and let 
(n^jNty)) be given by Corollary 16.81 for k = k^. Apply the previous lemma 
to those n^s. It gives us formulas ifto, . . . , ip n -\- For i < n, let 

tyi(x;y y N(i>i) ) = \J ifji(x;zj). 

i=i 

Let now M \= T, A C M and b G M. By the previous lemma, there is some 
i < n such that for any A C <p(A; b) of size < n^., we can find d E A with 
6)C^(A;6). 
Let A a C b) be finite. Let 

5 = d) : ^(A; d) C d)} C ^(A,). 

Then the set system S has VC-dimension less than or equal to VC(^j). Fur- 
thermore, the assumptions on ipi imply that S has the (n^ v n^.)-property. 
Therefore by Corollary I6.8[ there are S±, . . . , Sn^) which cover the whole of 
A x . Write Sj = ^{Ai, dj) . Then 

A 1 C di, . . . , djv(^)) C 0(A;6). 

If |A| > 2, then by usual coding tricks, we can replace the finite set 
{^0; • • • j ^n-i} by a single formula, and the theorem follows. □ 

The following corollary is usually referred to as "uniform definability of 
types over finite sets", or UDTFS. 

Corollary 6.12 (UDTFS). Let 4>(x;y) G L, then there is ip(x;z) G L such 
that for any b G U and A dU a finite set, there is d G A with 

( p(A;b) = ^(A-d). 
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Proof. Simply apply Theorem 16.111 with A = An. □ 

Note that this implies that the number of 0-types over A is bounded by 
|^4|' Z L One can see this as a model theoretic version of Sauer's lemma I6TB1 which 
says that the number of 0-types is polynomial in the size of A. However we 
here have no a priori bound on \z\. 

Remark 6.13. We have used the fact that the theory T is NIP to obtain an 
honest definition for <j)(x; b) and again to apply Corollary 16.81 to the formulas 
ip(x; z). In particular, the proof does not go through assuming that only (f>(x; y) 
is NIP. It is an open question whether or not UDTFS holds in that case. 

Historical Remarks 

VC-dimension was introduced in the context of statistical learning theory by 
Vapnik and Chervonenkis in [93] where Theorem 16.41 is proved. It has played 
a major role in that area. Sauer's lemma 16.31 is implicit in [93], and was 
rediscovered independently by at least two authors: Shelah [79] and Sauer [73] . 
Shelah and Sauer's proofs are by induction on n. The proof we give was found 
independently by Alon [7] and Frankl [31~j . 

The (p, q) theorem was proved for families of convex subsets of the euclidian 
space by Alon and Kleitman [3] and adapted to families of finite VC-dimension 
by Matousek [62]. The udtfs property was conjectured by Laskowski and proved 
for o- minimal theories by him and Johnson in [31]. Then Guingona proved it 
for dp-minimal theories [33J. It was proved for all NIP theories in [22]. This 
problem is linked to a similar question in machine learning theory (existence of 
compression schemes for learnable families). See the discussion and references 
in [3E]. 



Chapter 7 
Measures 



In this chapter we study Keisler measures which can be seen as a generalization 
of types which allow for truth values in the segment [0, 1], or as ordinary 
probability measures on the compact space of types. The two points of view 
are useful and we will often switch from one to the other. 

Many properties of types generalize naturally to measures. In particular, 
we will define generically stable measures. As we will see, those are ubiquitous 
in NIP theories, as opposed to generically stable types. One way to obtain such 
a measure is to take a a-additive propability measure on a standard model (for 
example the Lebesgue measure on [0, 1] seen as subset of the structure M). 

The main use of measures seems to be as invariant measures on group, but 
we will not explain that here and refer the reader to [17] and [12] for more 
information. 

In this chapter, we again assume that T is NIP, although the basic defini- 
tions are of course valid in any theory. 

7.1 Definitions and basic properties 

If A is a set of parameters, and x a variable, we let C X (A) denote the algebra of 
A-definable sets in the variable x. Equivalently, it is the boolean algebra of for- 
mulas with parameters in A and free variable x, quotiented by the equivalence 
relation ~ ip(x) •<==>■ U \= <f)(x) ijj{x). 

Definition 7.1. Let A C M be a set of parameters. A Keisler measure (or 
simply a measure) \x over A in the variable x is a finitely additive probability 
measure on C X (A). In other words it is a function /x : C X (A) — > [0, 1] such that: 
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n{x = x) — 1; 
fi(^<f>(x)) = 1 - n{(j){x))\ 

A ifj(x)) + V ^(x)) = /i(0(x)) + /jl((i/j(x)). 

We will sometimes write /i as [i x or /i(a;) to emphasize that fi is a measure 
on the variable re. If A C 5, and /i G 971^.(1?), we define the restriction of fi to 
A denoted or f A as the restriction of to As for types, we say 

that /i is an extension to B of /j,\a- 

Example 7.2. • If p G ^(A) is a type over A, then p will be identified 
with the Keisler measure n P (x) over A defined by n P ((j)(x)) = 1 when 
p h Thus a (complete) type is a special case of a Keisler measure. 

We will often identify the type p with the associated measure, and write 
for example p((fi(x; b)) which stands for /i p (0(x; b)). 

• Given a , a\, . . . in [0, 1] such that ^0^ = 1, and types Po,Pi, ■ ■ ■ over A 
in the same variable x, we can define the average measure /i = Y2 a iPi- 

• Take T to be the theory of real closed fields and let 1R be the standard 
model. Let no be any Borel probability measure on IR. Then /^o induces a 
Keisler measure over U in one variable x defined by = fio(<f)(M>)) . 

• Let I = (flj : i G [0, 1]) be an indiscernible sequence. Let A denote the 
usual Lebesgue measure on the interval [0, 1]. We can define the average 
measure Av(I) as the measure /x defined by n(4>(x; b)) = Xo({i G [0, 1] :|= 
4>(ai]b)}). Note that NIP ensures that the set in question is Lebesgue 
measurable (it is in fact a finite union of intervals). 

• Let (M n : n < u) be a sequence of finite structures. For each n < u, 
let fi n denote the counting measure on M n . Let T> be an ultrafilter on 
u and consider the ultraproduct M = [j^M,,. We define a measure 
fi on M in the following way. Let <j)(x; b) G L(M) be a formula. Let 
{b n : n < uj) be a representative of b in Y\ n<UJ M n . Then set fi(4>(x; b)) = 
limp yU n (0(M n ; b n )). 

• Here is an example outside of NIP that can serve as counter-example to 
most of the results in this section which require NIP. Let T be the theory 
of the random graph in the language {R}- Let M \= T. Define a Keisler 
measure /i on Mby/i (Ci^xRai)^) = 2~ n , where the Oj's are pairwise 
distinct points of M and i] : n — > {0, 1} is any function. 
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Let DJl x (A) denote the set of Keisler measures over A. It is a closed subset 
of [0, l] £a; ( A ) ; equipped with the product topology. We equip 9Jt x (A) with the 
induced topology, making it a compact space. The identification of a type with 
the measure it defines gives an identification of S X (A) as a closed subspace of 
Wl x (A). 

Borel measures 

Let /j, G $Jl x (A) be a Keisler measure. It assigns a measure to every clopen 
set of the space S X (A). We show how to extend that measure to a a- additive 
Borel probability measure. First, if O C S X (A) is open, we define fi(0) = 
sup{fi(D) : D C 0,D clopen}. And similarly for closed sets. Note that if X 
is either closed or open, then we have 

(Reg) sup{/i(F) : F C X, F closed} = mi{/i(0) :XCO,0 open}. 

We show that the set of subsets X C S X (A) satisfing (Reg) is closed under 
complement and countable union. Complement is clear. For countable union: 
let X = y} i<UJ Xi and fix e > 0. For each % < u, take Fj C I f C Oj with 
\x(Oi \ Fi) < e2~\ Let O = U i<UJ Oi- Then we can find some finite N such 
that y,{q \ {Ji<N°i) < e - Let F = lJi<N F i- Then we have F C X C O and 
fi(0 \F)< 2e. 

It follows that every Borel subset of S X (A) satisfies (Reg). We can therefore 
define fi on all such sets by n(X) = sup{/i(F) : F C X,F closed} = inf{//(0) : 
X C 0,0 open}. It is easy to check that this defines a a- additive measure on 
S X (A). Property (Reg) is refered to as regularity of the measure \i. 

So, to a Keisler measure on A, we have associated a regular probability 
measure on S X (A). Conversely, if /i is a regular probability measure on S X (A), 
then it defines a Keisler measure by restriction to the clopens. Regularity 
ensures that fi is entirely determined by that restriction. We therefore obtain 
a bijection 

{Keisler measures on A} i — > {Regular Borel probability measures on S X (A)}. 

We will often use that bijection implicitly, using the same notation for the 
Keisler measure and for the Borel measure on the type space. In particular, if 
fi is a Keisler measure on A, we may write fi(X), where X is a Borel subset of 
S X (A). 
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We define the support of fi as the set C S X (A) of types p G S X (A) such 
that p((p(x;b)) = 1 ==>■ fi(<f)(x;b)) > for any 4>(x;b) G The support 

of /i is thus a closed set of S X (A). A type in the support of \i is called weakly 
random for //. 

If X is a Borel set of positive /i-measure, we define the localization of \x at 
X to be the measure Hx{ x ) defined by fi X ((fi(x)) = /i(0(x) D X)//i(X). 

Extending measures 

The next lemma says that any partial measure in a suitable sense extends to 
a full Keisler measure. 

Lemma 7.3. Let Q C C X (A) be a set of (equivalence classes of) formulas 
closed under intersection, union and complement and containing T . Let Ho be 
a finitely additive measure on Q with values in [0, 1] such that ;Uo(T) = 1. Then 
fi extends to a Keisler measure over A. 

Proof. By compactness in the space [0, l]^^), it is enough to show that given 
ipi(x), . . . , ipn(x) formulas in jC x (A), there is a function / = (ipi, . . . ,ip n ) — > 
[0, 1] finitely additive and compatible with fiQ (where (B) denotes the boolean 
algebra generated by B). We may assume that ipi, . . . , ip n are the atoms of the 
boolean algebra B that they generate. 

The elements of Q in B form a sub-boolean algebra. Let 0i, . . . , m be its 
atoms. We have say: 

01 = ^1(1) V--- V^tiflO ^2 = ^ia(l) V " ' " V ^i2(b) etc - 

Then any finitely additive / satisfying /Ou(i)) H h f(i>h(h)) = A*o(0i) etc. 

will do. □ 

Lemma 7.4. Lei /J G SOt^M) &e a measure and let <f>(x;b) G L{U). Let 
7*1 = sup{/x(/?/>(x)) : ?/>(x) G L(M), \= if}(x) —> 0(x;o)} and r 2 = inf{/x(/?/>(x)) : 
?/>(x) G L(M), |= 0(x; 6) -»■ ?/>(x)}. 

Taen /or an?/ 7*i < r < r 2 , there is an extension v G VJt(U) of \i such that 
v((j)(x] b)) = r. 

Proof. It is enough to find some z/ 1; v 2 such that z/j(0(x; 6)) = 7*3 for i G {1, 2}, 
since we can then consider averages of and z/2. We build v 2 - Let C C X {U) 
be the boolean algebra generated by L(M) and the formula <f>(x; b). By Lemma 
17.31 it is enough to define v 2 on Q. 



7.2. BOUNDEDNESS PROPERTIES 
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First assume that r 2 = 0. Then for any Q(x) G L(M), set V2(9(x)) = fi(9(x)) 
and set U2(4>(x;b)) = 0. This extends uniquely to a finitely additive measure 
on Q. Thus we are done. A similar argument works for r 2 = 1. 

If r 2 > 0, then let p! be the localization of /1 on the closed set C = f\9(x) 
where 9(x) ranges over formulas of L(M) such that |= <f){x;b) — > 9{x). Let 
fi" be the localization of /1 on the complementary open set. Then we have 
[i = r2/i' + (1 — r2)/i". By the previous paragraph, we can extend fi' and /j," 
respectively to v' and v" such that v' (<j)(x; b)) = 1 and u" (<f>(x; b)) = 0. Then 
set v 2 = r 2 z/ + (1 — r 2 )zA 

The construction of V\ follows by taking -«f>(x; b) instead of <p(x; b). □ 

7.2 Boundedness properties 

Lemma 7.5. Let fi G Wl x (M) be a mesure and (pi : % < u) an indiscernible 
sequence in M. Let (j)(x;y) be a formula and r > such that n(<f>(x;bi)) > r 
for all i < uj. Then the partial type {<f)(x; bj) : i < u} is consistent. 

Proof. First we show that we can assume that the sequence (pi) is /i-indiscernible, 
by which we mean that if %\ < • • • < i n < oj and ji < ■ ■ ■ < j n < then 
b h ) A • • • A (f)(x; b in )) = p((j)(x; b h ),..., (j)(x; b jn )). 
Consider the set of conditions on a measure v G 9Jt x (M): 

- bi)) > r, for all i < u; 

- \u(cj)(x;b h ) A--A<p(x;b in ))-u(<i>(x;bj 1 ),...,<f>(x;b jn ))\ < 1/N, for every 
integer iV and any i\ < ■ ■ ■ <i n < uj and ji < ■ ■ ■ < j n < UJ - 

Note that each one of these conditions defines a closed subset of 9Jl x (M). 
By Ramsey and the existence of /1, we see that any finite set of such conditions 
is consistent. Therefore by compactness in the space $R X (M), the whole set is 
consistent, realized by some v. Replacing fi by u, we have what we want. 

Assume for a contradiction that {<fr(x; bi) : % < u>} is inconsistent. Then 
there is some such that fi{<p{x; bo) A • • • A &at-i)) = 0. Take a minimal 
such A^ and let N' = N — 1. For any integer m, let ip m {x) = <fi(x;b m N') A 
■ ■ ■ A (f)(x;b m N'+N'-i)- Then by minimality of A^ and indiscernability of the 
sequence, there is some t > such that we have p(ip m (x)) = t for all m. Also 
by the property of N, we have fi(ip m (x) A ip m '(x)) = for m 7^ m' . 

But then we have fi(ipo(x) V- • • V^ m _i(x)) = mt, for all m. This contradicts 
the fact that the measure of the whole space is 1. □ 
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Lemma 7.6. Let fi G D3l x (M). We cannot find a sequence (pi'.i < u) of tuples 
of M, a formula (fi(x;y) and e > such that fi((j)(x;bi)A(j)(x;bj)) > e for all 
i,j <u, j. 

Proof. Assume otherwise. Then by Ramsey and compactness, we may assume 
that the sequence (pi : i < uf) is indiscernible. By NIP, the partial type 
{(f)(x] &2fc) A0(x; &2fc+i) : k < u} is inconsistent. This contradicts the previous 
lemma. □ 

Let fi G Wl x (M). If X(x) and Y(x) are two M-definable sets, set X ~^ Y if 
n(XAY) = 0. Then the previous lemma implies that the set of ~ M -equivalence 
classes has bounded cardinality. In particular, the support S(fi) of /i has 
cardinality bounded independently of M. 



7.3 Smooth measures 

We now define and investigate the notion of a smooth measure, which can be 
considered as an analog of realized types. 

Definition 7.7. Let fi G 9Jt x (M). We say that \x is smooth if for every iV D M, 
H has a unique extension to an element of Wl x (N). 

More generally, if [i G 9Jt x (A^) and M C N, we say that \x is smooth over 
M if y\u is smooth. 

Lemma 7.8. Let \l G Wl x (M) be a smooth measure. Let (f)(x, y) G L and e > 0. 
Then there are formulas 9}(x), 9f(x) for i — 1, . . . , n and ipi(y) for % — 1, . . . , n, 
all over M such that: 

(i) the formulas ?pi(y) partition y -space; 

(ii) for all i, if \= ipi(b), then \= 9}(x) — > <j>(x, b) — > 9f(x); 
(Hi) for each i, /jb(9f(x)) — fi(9j(x)) < e. 

Proof. Let b eW. Then by smoothness of \x and Lemma TTM, there are formulas 

9}(x), 9f(x) over M, such that 

(*) h 9}(x) <j>(x,b) 9*(x), and ^(x)) - p{0\{x)) < e. 

By compactness, there are finitely many such pairs, say, (9} (x) , 9f (x)) such 

that for every b one of these pairs satisfies (*). It is then easy to find the 

My)- □ 

Note that conversely, the existence of the formulas 9j(x) and ipi(y) with the 
above properties implies that the measure /j, is smooth. 



7.3. SMOOTH MEASURES 
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Proposition 7.9. Let fi G Wl x (M) be any measure. Then there is M ~< N 
and an extension fi' G DJl x (N) of fi which is smooth. 

Proof. Assume not. We build an increasing sequence (M a : a < \T\ + ) of 
models, with M = M and associated increasing sequence of measures fi a G 
DJl x (M a ). At limit stages, take the union. Assume M a , fi a are defined. By hy- 
pothesis, n a is not smooth, so we can find some a (x; b a ) G L(U) and fi 1 , fi 2 two 
extensions of fi a such that fi 2 (4> a (x; b a )) — /^ 1 (0 Q (x; b a )) = 4e a > 0. Then take 
M a+ i an extension of M a containing b a and set fi a+ \ = l/2(/i 1 + /i 2 ). Note that 
fi a +i has the following property: for any 9(x) G L(M a ), fi a+1 (9(x) A0(x; b a )) > 
e a . 

Having done the construction, we may assume that <j) a = <fr, and e a = e > 
are both constant. Let ji 1 be the measure U«<|t|+ /•*«■ Then we have 

fjf{<p{x] b a )A(j)(x] bp)) > e 

for any a < (5. This contradicts Lemma [7.61 □ 

We now show that smooth measures in NIP theories can be approximated 
by averages of points. 

If fiQ, . . . , /!„_i are measures and <j)(x; b) is a formula, then the notation 
Av(/i , . . . , [in-i; (j)(x; b)) stands for J J]fc<n b)). 

Similarly, if ao, . . . , a n _i are tuples, then Av(ao, . . . , a n -i] b)) stands for 
:\= 4>(ai] b)}\. We extend this notation naturally to the case where 4>(x; b) 
is replaced by an arbitrary Borel subset of some type space. 

Proposition 7.10. Let \i x be a global measure, smooth over M. Let X be 
a Borel subset of S X (M), and <j)(x;y) a formula. Fix e > 0. Then there are 
a , • • • , a n _i G U such that for any b eU, 

\n(X D <f>{x; b)) - Av(a , . . . , a n _i; X n <f>{x; b))\ < e. 

Proof. Fix formulas ipi(y), 9®(x) and 9]{x), i < m as given by Lemma 17.81 
Consider probability measure on the space S X (M). Then we can find 

types (pi : i < n) in S X (M) such that if A = -J2i< n Pi then \(9®(x) fl X), 
X(9l(x) nX) are all within e of the corresponding fi(9^(x) HX), fi(9}(x) DX). 

Now take points (oj : i < n) in U such that |= pi. Set A' = - J2i< n tp( a i/^)- 
Let b G U and let i < n be such that |= tpi(b). Then we have |= 9®{x) — > 
(f)(x; b) -»■ 9}{x) and n(9}(x)) - M(x)) < e. This implies that \(9}(x)f)X) - 
\{9?(x) n X) < 3e and therefore \'{(j){x; b) D X) is within 3e of A(0(x; b) n X) 
and thus within 4e of 6) fl A). □ 
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We conclude that we can approximate any measure by averages of types. 

Proposition 7.11. Let p G ^K X {A) be any Keisler measure, and let <p(x; y) G L 
be a formula and fix Xi, . . . , X m C S X (A) Borel subsets. Let e > 0. Then there 
are types po, ■ ■ ■ ,p n -i G S X (A) such that, for every b G A and every k < m: 

\p(<p(x\b) nX k ) - Av(p , . . . ,p n -r, (f)(x;b) nX k )\ < e. 

Furthermore, one can impose that p k G S(p) for all k. 

Proof. Let v x be a smooth extension of p over a model N D A. Let X C S X (A) 
be the support of p. The previous proposition works equally well with finitely 
many Borel subsets instead of one and gives points a , . . . , a n _x G N such that 
\p((j)(x; b) fl X k ) — Av(a , . . . , a n -i] (p( x ] b) D X k )\ < e for all b G N and k < m. 
Then set pi = tp(di/A). 

The "furthermore" part follows by taking S(fi) to be one of the X k s. □ 

Exercise 7.12. Give another proof of Proposition \7.11\ using the VC-theorem 
instead of smooth measures. Deduce that the number n of types can be chosen 
so as to depend only on VC-dim(<f)(x; y)) and e (see jUl Lemma 4.8]). 

7.4 Invariant measures 

We now extend a number of definitions from types to measures. 

Definition 7.13. Let ji G DJt x (U) be a measure, and let A dU. We say that 
p, is A-invariant if for every b =a b', and <fr(x;y) G L, we have fi(<p(x;b)) = 
fi((j)(x;b')). 

Definition 7.14. Let A C B and \x G VJl x (B). Then p does not fork (resp. 
divides) over A if p((j)(x; b)) = for every formula <j)(x; b) G L(B) which forks 
(resp. divides) over A. 

Proposition 7.15. Let p G 97t x (W) and A C U. Then u. does not fork over A 
if and only if it is Lstp (A) -invariant. 

Proof. Let p G VJt x (U) be Lstp(A)-invariant. As it is a global measure, it is 
enough to show that it does not divide over A. Let <f)(x; b) G L(M) be such 
that p{<f){x; b)) > 0. Let (bi : i < uj) be an A-indiscernible sequence with bo = b. 
Then by assumption bi)) = p(4>(x; b)) for all i < u. By Lemma 17751 the 
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partial type {4>(x; hi) : i < 00} is consistent. It follows that 4>(x; b) does not 
divide over A. 

Conversely, assume that fi does not fork over A. If Lstp(&A4) = Lstp(6'/A), 
then the formula <f>(x;b)A(f)(x;b') forks over A. Thus /i(</>(x; 6) A0(x; tV)) = 
0. □ 

Note in particular that, if A = M is a model, and /i does not fork over M, 
then we have n(<j)(x; b)A<f)(x; b')) = whenever b =m b'. 

Definition 7.16. Let M \= T and let fi x £ £0l(W): 

• fj, is finitely satisfiable in M if for every 6) £ with fj,((j)(x; b)) > 0, 
there is a £ M such that W |= 0(a; 6). 

• /x is definable over M if it is M-invariant and for every 4>{x\ y) £ L, and 
r £ [0, 1], the set {p £ S y (M) : ^{4>{x; b)) < r for any 6 £ W, b \= p} is an 
open subset of S y (M). 

• fi is B orel- definable over M if it is M-invariant and the above set is a 
Borel set of S y {M). 

Note that if fi is finitely satisfiable in M then it is M-invariant. Also \x is 
definable over M if and only if it is M-invariant and for any open (resp. closed) 
subset X of [0, 1], the set {p £ S y (M) : fi((f>(x; b)) £ X for any b £ U, b \= p} is 
an open (resp. closed) subset of [0, 1]. 

A measure /x is M-invariant (resp. finitely satisfiable in M), if and only if 
all types in S(fi) are M-invariant (resp. finitely satisfiable in M). 

Lemma 7.17. Let /j, £ M X {U) and M -<U. 

(i) If fi is smooth over M , then \x is finitely satisfiable and definable in M. 

(ii) If fi is M-invariant and smooth, then it is smooth over M. 

Proof, (i): Let (f>(x; b) be such that fj,(4>(x; b)) > 0. Then by lemma 171^1 taking 
e small enough, there is a formula 6°(x) £ L(M) such that |= 9°(x) — > <f)(x; b) 
and fi(8°(x)) > 0. In particular, 6°(x) is consistent, and therefore has a point 
in M. It follows that also (j)(x; b) has a point in M. 

If 4>{x\ y) and e > are fixed, then again apply Lemma 17.81 to obtain 
formulas ipt(y) £ L(M), i < n giving a partition of ?/-space. Let r £ [0, 1] be 
such that fi(4>{x; b)) < r. Let e > such that /i(0(x; b)) < r — 2e. Take ^>f(y) 
satisfied by b. Then for any b', \= ipl{y) implies that /i(0(x; b')) < r — e < r. 
This proves definability of /i. 
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(ii): Assume that \x is M-invariant and smooth. First note that, as in 
Lemma [2.211 \x is definable over M and finitely satisfiable in M. Fix a formula 
(j)(x;y) and e > 0. Lemma 17131 gives us formulas ipi(y; d), d) and 9}(x; d), 
i < n, where we have made the parameters explicit. By definability of /i, 
there is an M-formula ((z) satisfied by d and such that for any d' G ((U), the 
formulas tf)i(y;d'), 8®(x;d') and 9j(x;d'), i < n also satisfy the conclusion of 
the lemma. Then we can find such a d! in M. It follows that \x is smooth over 
M. □ 

Lemma 7.18. Let p G S{U) be some M-invariant type, then p is Borel- 
definable over M. 

Proof. Let y) be a formula. We need to see that the set {b G U : p h 
4>(x;b)} is Borel over M. Let b G U. Take a maximal N such that there is a 
sequence (a^ : i < N) |= p^ n '\m with 

(*) -i(0(oi; 6) O 0(a i+ i; 6)) for i < JV - 1. 

By the discussion on eventual types in Section [2.2.21 we know that 0(a/v-i; 6) 
holds if and only if p h <p(x; b). 

Let Aat(6) say that there are (a, : i < N) such that (*) holds and |= 
(j){ajs[-x,b). Similarly we say that B N {b) holds if and there is (a, : % < N) 
such that (*) holds and \= -i<^(ajv-i; b). Then both A N (x) and B N (x) are 
type-definable sets. We have p h 4>(x; b) if and only if there is some integer 
N < alt(0) + 1 such that A N (x) A ->B N+ i(x) holds. 

This shows that {b G U : p h 0(a;;6)} is a finite boolean combination of 
closed sets, definable over M. □ 

Proposition 7.19. Let fi G DJl(U) be M-invariant, then \i is Borel- definable 
over M . 

Proof. This follows easily from the previous lemma and Proposition 17.111 which 
says that a measure can be approximated by types in its support. □ 

We now have all we need to define the product of an invariant measure over 
another, similarly as we did for types. So let fi x G Wt(U) be M-invariant and 
let \ y G DJl(U) be any measure. We define u xy = fi x <E> as a global measure 
in two variables by the formula: 

u xy (<f>(x,y;b)) = f(p)d\ x \ N , 

Jp£Sy{N) 
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where / : p H- p{<fi{x , d; b)) for some (any) d |= p and AT is a small model 
containing Mb. Note that / is a measurable function by Borel-definability of 
p. We need to see that this definition does not depend on the choice of N. 
So let Ni C iV 2 be two models containing Mb and let /i, f 2 be the associated 
functions. Let ir : S(N 2 ) — > S(Ni) be the canonical restriction map. Then 
f 2 = 7r~ 1 (/ 1 ). For any clopen set B C A 7 !, A|jvi(.B) = A Itv^tt" 1 (£?)), hence this 
holds also when S is Borel and then by appromiating fx by step functions, we 
deduce that the inegral of fx and f 2 are the same. 

Note that if p = p and A = q are types, than we recover the usual product 
p x ® q y - 

If A is itself M-invariant, then p x <g> A^ is M-invariant. Also if both p and 
A are finitely satisfiable in M (resp. M-definable) , then p x £g> X y is finitely 
satisfiable in M (resp. M-definable). 

We now argue that the product of measures is associative. Let p x ,r] y 
and X z be three global measures, and assume that p and 77 are both M- 
invariant. Let <f)(x,y,z) G L(M), let a = {{p x <g) r] y ) (g) A z ) ((j)(x, y, z)) and 
b = (p x <S> (r) y <S> \ z )) (0(x, y,z)). We want to show that a = b. 

First, assume that p = p is a type. Let c?0(|/, z) denote the Borel subset 
of Sy tZ (M) defined by r G d<fr <^=>- p h <fr(x,b,c) where (b, c) |= r. Then 
b = r) y ® X z {d<j){y,z)) = J geSy(M) f(p)d\, where / : g -> i](d<f>(y,c)), for some 
c |= g. But this is exactly a. 

Now if /i is an arbitrary invariant measure, then given a formula </>(rr, y, z) 
and e, we can find a measure p, which is an average of types such that 
\p((f)(x, b, c)) — p((f>(x, b, c))| < e for all b,c. Let a, 6 be the corresponding a 
and 6 with p replaced by p. Then a and fe are respectively at distance e from 
a and b. By the previous paragraph, a — b, and we conclude that a = b. 

If p is an M-invariant global measure, then we can define by induction 
p^(x) = p(x) and p( n+1 \xo, . . . , x n ) = p(xo) ®p^ n '(xi, . . . , x n ). And of course 
p^ u '(xo, x±, . . . ) is the union of p( n \xo, . . . , x n _i) for n < tu. 

Similarly as for types, we prove that a finitely satisfiable measure and a 
definable one commute. 

Lemma 7.20. Let p x ,X y be two global M-invariant measures. Assume that 
p x <S> p y = p y <8> p x f or an U P ^ Sy{U) in the support of A. Then p and A 
commute. 

Proof. Let <p(x, y) be a formula, and write (p x <g>\ y )((f)(x, y)) = f peSy{M) f(p)d\ x 
as in the definition. Fix e > 0. Approximate that integral up to e by a finite sum 
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Ysi=i,...,nK x i) c ii where X { = {q E S y (M) : b)) E [n,ti], for some b \= 

q} for some r i; ij E [0, 1]. By Proposition I7.11[ we can find types q±, . . . , q m 
weakly random for A, such that if A denotes the average ^ Y^Qji then: 

- A(Xj) is within e of A(Xj), for all i < n; 

- \(4>(a, y)) is within e of A(0(a, y)) for all a ElA. 

The first condition ensures that fi x ®\ y {4>{x, y)) is within e of [i x (g>\ y ((j)(x, y)) 
and the second one ensures that \ y <g> /i x ((j)(x, y)) is within e oi \ y ® fi x ((j)(x, y)). 
As /i commutes with A, the result follows. □ 

Proposition 7.21. Let fi x E Wl(U) be definable and X y E Wl(U) be finitely 
satisfiable, then fi x <g> \ y = \ y (g) fi x . 

Proof. By the previous lemma, we may assume that X y = q y is a type. Let 
M such that both /i and q are M-invariant. Assume for a contradiction that 
there is r E [0,1] and e > such that \i x ® q y ((j)(x;y)) < r — e whereas 
q y g) fi x ((fi(x; y)) > r + e. By definability of fi, there is an M-formula i/)(y) such 
that q h ^(y) and for all b E ip(U), we have n(<j)(x] b)) < r. Pick pi, . . . ,p n E 
S(M) such that for every b E M, Av(pi, . . . ,p n ; <p(x; b)) is strictly within e of 
fj,(4>(x; b)) . Realize each p { by a { E U. By finite satisfiability of q, find some 
b E ij){M) such that |= (f)(ai; b) <^==^> q h b) for all i. 

By choice of p^'s and ^(y), we have Av(pi, . . . ,p n ; (j)(x; b)) > r. But that 
quantity is equal to ^ ^ q((j)(ai; y)), by choice of b. And therefore must be 
> r. Contradiction. □ 



7.5 Generically stable measures 

Similarly as we did for types, we define a generically stable measure as being 
a global measure which is both definable and finitely satisfiable (in some small 
model M). We will prove an analog of Theorem 12.331 First we define a new 
notion. 

Definition 7.22. Let fi x be a global M-invariant measure. We say that fi 
is fim (frequency interpretation measure) if for any formula <f>(x; y) E L and 
e > 0, there is a family (6 n (xi, . . . , x n ) : n < u) of formulas in L(M) such that: 

- lim^(6> n (xi,.. . ,x n )) = 1; 

- for any (ai,...,a n ) E 9 n (U), and any b ElA, Av(a l5 . . . , a n ; <f)(x; b)) is 
within e of /x(0(x; b)). 
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We will see later that a measure is fim if and only if it is generically stable. 
Left to right is easy, but the converse requires more work. 

The main result needed is an adaptation of the VC-theorem from measure 
theory to the context of Keisler measures. 

VC-type results 

We call a measure n(xi, . . . , x n ) over A symmetric if for any permutation a of 
{1, . . . , n) and any formula <j)(xi, . . . , x n ) G L(A), we have fi(<f)(xi, . . . , x n )) = 
{j,((f)(x a i, . . . , x an )). We extend naturally this definition to measures in infinitely 
many variables. 

In the following, we fix some definable set X over which variables x, Xi, . . . 
range. Let also y) be a formula. Let n be an integer, and write x = 
(xi, . . . , x n ) and x' = (x[, . . . , x' n ). We define 

S' n (x,x'; b) = | Av(xi, . . . ,x n ; b)) - Av(x[, ...,x' n , 4>(x; b))\ 

and 

S n (x, x') = sup S' n (x, x'; b). 

Note that for any e > the statement 5' n (x,x;b) < e is a definable property 
of (x,x';b). Similarly, the statement S n (x, x') < e is a definable property of 
(x, x'). 

Lemma 7.23. Let (f>(x;y) be a formula, and fix e > 0. For each n < u, 
let fi n (xi, . . . , x n , x[, . . . , x' n ) be a symmetric measure on X 2n over 0. Let E n 
denote the fi n - expectancy of 5 n . Then E n — > 1. 

Proof. The proof is essentially the same as that of the claim inside the proof 
of Theorem 16.41 As done there, we set Y = (2^)™ and consider its action on 
X 2n by flipping variables. We fix e > and define as in 16.41 the set A* C Y 
of bad permutations, which has measure at most 27r^,(2n) exp(— ne 2 /8), where 
7T0 is the shatter function of the family of sets {4>{U; b) : b ElA}. For a ^ A*, 
we have 8 n (a(x, x')) < e/2. For a G A* we have at any rate 5 n (a(x, x')) < 1. 
Thus for n big enough, for any x, x', 

2-^5 n (a(x,x'))<e. 

crGY 

By the symmetry of fi n , E n equals the //^-expectation of sup b 5 n (a(x, x')) for 
any a G Y, hence it is also equal to the expectation of the average. Therefore 
for n big enough, E n < e. □ 
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Corollary 7.24. Let fi(xx, • • • ) be a Keisler measure in u variables over some 
set A. Assume that fi\$ is symmetric. Let <p(x;y) G L and fix e > 0. Then 
there is n such that fj,(3y(5' n (xi, . . . , x n , x n+ i, . . . , X2n] y) > e)) < e- 

Proof. Apply Lemma 17.231 letting /i„(xi, . . . , X2 n ) be the restriction of \x to the 
variables (x\, . . . , X2 n )- □ 

Proposition 7.25. Let fi be a global M-invariant measure, and assume that 
l]S u '{x\, . . .)\® is symmetric. Then for any formula 4>(x;y) and e > 0, there 
are a\, . . . ,a n <EU such that for all b EU, Av(ai, . . . , a n ; <fi(x; b)) is within e of 
fi((j)(x;b)). 

Proof. Fix a formula <fi(x;y) and e > 0. Write a = (ai, . . . ,a n ) and a' = 
). Define 8 n (a,a') to say that for all b, Av(ai, . . . , a n ; <p(x; b)) is 
within e of Av(a' 1 , . . . , a' n ; (p(x; b)). For n big enough, we have the inequality 
l_i( 2n \9(xi, . . . , x 2n ) > 3/4. In particular, there is a G U, such that we have 
^ n \9{a,x 1 ,...,x n )) > 3/4. 

On the other hand, let ( n (xi, . . . ,x n ) say that Av(xi, . . . ,x n ;(f)(x;b)) is 
within e of fi{<p{x; b)). Then, since the variance of fi(4>(x;b)) is at most 1, 
Chebychev's inequality flC.2j) gives ^ n \( n (xi, . . . , x n )) < 1/ne 2 . Taking n 
large enough, we have ^ n \C, n {xi, . . . ,x n )) < 1/2. 

It follow that (j,( n '(0(a,Xi, . . . ,x n ) A ( n (xi, . . . , x n )) > 0. In particular 
Av(ai, . . . , a n ; (j)(x; b)) is within 2e of b)), as required. □ 

We can do the same incorporating a Borel set X. 

Proposition 7.26. Let fi be a global M-invariant measure, and assume that 
fi^(xi, . . • )|0 is symmetric. Then for any Borel set X , formula (p(x; y) ande > 
0, there are a±, . . . , a n G U such that for all b G U, Av(a 1; . . . , a n ; <fi(x\ b) n X) 
is within e of b) n X). 

Proof. We need to go through the proof of Lemma [7.231 and of Proposition l7.25l 
replacing everywhere 4>{x;b) by 4>{x;b) PI X. This poses no difficulty. □ 

Remark 7.27. The statements above imply that if ji is A-invariant and fi^ is 
symmetric, then jj, is equal to the average of any realization of im^>\a- More 
precisely, let X(x±, . . . ) be a global measure which extends fj,^ \a- Let 4>(x; b) G 
L{U) be a formula and let r = b)). Fix e > 0, then for all but finitely 

many values of n, we have X((f>(x n ] b)) G (r — e, r + e). 

This implies in particular that /i is determined by ^^\a- (This is also true 
for arbitrary invariant measures, but we do not have the tools to prove it). 
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Proof. Assume that we have X(<p(x n ;b)) — r > e for infinitely many values 
of n. Without loss, this is true for all n. Now set u = iv^>{x' x , x' 2 , . . . ) <8> 
X(x±,X2, • • • )• Then cu\$ is symmetric. By Corollary \7.24\ for n large enough, 
we have oj(5' n (xi, . . . , x n , x[, . . . , x' n ; b) < e/4) > 1/2. 

Let 9 n (xi, . . . , x n ) say Av(xi, . . . , x n ; </>(x; b)) — r > e/2. Then by the 
weak law numbers, we have X(6 n (xi, . . . , x n )) — > 1. Letting ^(x^, . . . , x' n ) say 
| Av(x' 1 , . . . , x' n ; (j){x\ b)) — r\ < e/4, then we also have ^ u >{6' n {x'^ . . . , x^)) — > 1. 
It follows that o;(^ n (xi, . . . ,x n ) A ^(x^, . . . ,x' n )) — > 1. But this contradicts 
what we established in the previous paragraph. □ 



Properties of generically stable measures 

Theorem 7.28. Let fi be a global M -invariant measure. Then the following 
are equivalent: 

(i) n is generically stable; 

(ii) n is fim; 

(Hi) /j,^(xi, . . . )\m is symmetric; 
(iv) we have \i x <S> ji y — \i y <S> /V 

Proof, (i) ^> (iv): Follows from Proposition 17.211 
(iv) ^> (iii): Clear by associativity of (8). 

(iii) =>■ (ii): Follows from Proposition 17.251 (and its proof). 

(ii) => (i): Easy. □ 

Proposition 7.29. Let fi x be a generically stable measure, then for any in- 
variant measure X y , we have \i x <8> X y = X y <E> \i x ■ 

Proof. By Lemma 17.201 we may assume that X y = q y is an invariant type. 
Let M be such that both fi and q are invariant over M and let N D M 
be |M| + -saturated. Take some b \= q\x in U. Let <p(x,y) e L(M) be a 
formula. Let X C S X (M) be the set of types p such that p h 4>(a, y) for some 
(any) a |= p. Then, by definition, we have \i x ® q y (4>(x,y)) = /i(<p(x,b)) and 
qy®lix(<j>fay))=n(X). 

Fix e > 0. By Proposition 17.261 let a\, . . . , a n e N be such that for all 
b' G U, Av(ai, . . . , a n ; <p(x; b')) is within e of fi(4>(x; b')) and Av(ai, . . . ,a n ;X) 
is within e of fi(X). Then /i x g) q y (<p(x, y)) is within e of Av(oi, ■ ■ ■ , a n] 4>( x j b)), 
which by definition of X is within e of q y <8> fi x (<f)(x, y)). 

As this holds for all e > 0, the result follows. □ 
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Proposition 7.30. Let fix be generically stable and A-invariant. Then \x is 
the unique A-invariant extension of h\a- 

Proof. Assume that X x is a global A-invariant extension of \i. As in the proof 
of Proposition 12.391 we show that fi \a = ^^\a- It follows that A is also 
generically stable, and by the remark after Proposition 17.261 fi = A. □ 

Example 7.31. Recall the examples of measures given in Example 17.21 We 
go through them again, and see that most of them are generically stable. 

• If p G S X (U) is a global type, then p is generically stable as a measure if 
and only if it is generically stable as a type. More generally, an average 
^2 i<LlJ CLiPi of types is generically stable if and only if all the p^s are 
generically stable (assuming of course > for all i). 

• Any Borel probability measure on 1R induces a smooth Keisler measure. 
See [12]. In fact, more generally, under mild assumptions, if M is a model 
equipped with a cr-algebra making externally definable sets measurable 
and no is a cx-additive measure on M, then /x induces a generically stable 
Keisler measure. See jHTJ, Theorem 5.1] for a precise statement and a 
proof. 

• The average measure of an indiscernible sequence I — (cij : i G [0, 1]) 
is always generically stable. It is easy for example to check fim. It is 
smooth if and only if the sequence / satisfies a property called distality. 
See [86]. 

• Let M = \\ v M n be an ultraproduct of finite structures. The measure \i 
on M defined as the ultraproduct of the counting measures is generically 
stable. This follows from the VC-theorem which implies fim for the limit 
measure \i. 

Historical Remarks 

Keisler measures were introduced in the context of NIP theories by Keisler in 
[55] . There, he defines a notion of smooth measure which is weaker than the 
one we consider here. It says in some sense that the measure has a unique 
extension to the unstable part of the structure. In [16], Hrushovski, Peterzil 
and Pillay revive those ideas and use measures to study groups definable in o- 
minimal theories; in particular, the concept of an invariant measure of a group 
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is central in their approach. The boundedness properties come from that work. 
Those ideas are developed further in [UJ , where in particular Borel definability 
is observed. Finally, generically stable measures are defined and studied in |42j . 

Itai Ben Yaacov proves in [97J that the randomization of an NIP theory 
is NIP. That statement is equivalent to the following: Let u(xi, . . . ) be a 
measure such that for any formula 8(xi, . . . ,x n ), and tuples i\ < ■ ■ ■ < i n , 
j i < ■ ■ ■ < j n , we have u(6(x il , . . . ,x in ) = u(9(x jl , . . . ,x jn )). Then for any 
formula <f)(x; b) and e, there can be only finitely many indices n for which 
\oj(4>(x n ; b)) — u((f)(x n+ i; b))\ > e. This result is thus the analog of Lemma [2751 
for measures and could be used to translate in a uniform way proofs from types 
to measures. We have not used it in order to keep this text self-contained. 
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Appendix A 

Examples of NIP structures 



A.l Linear orders and trees 
Linear orders 

By a colored order we mean a structure (M; <, C\ : % < a) where the Cj's are 
unary predicates and where < defines a linear order on M. 

Lemma A.l (Rubin). Let (M; <, Cj : % < a) be a colored order and let a G M. 
Let bi G M be a tuple of points all smaller than a and 6 2 G M a tuple of points 
greater than a, then tp(6i/a) Utp(6 2 /a) h tp(6i,6 2 /a). 

Proof. The proof is a straightforward back-and-forth. 

Assume M is cu-saturated and we are given two pairs of tuples (61,62) 
and (6^,6 2 ) satisfying the hypothesis and such that tp(6i/a) = tp(6' 1 /a) and 
tp(6 2 /a) = tp(6 2 /a). Notice that necessarily tp 9 ^(6i, 6 2 , a) = tp (? j(6' 1 , 6' 2 , a) 
(quantifier free types). Assume we are given 6 G M, without loss, 6 < a. We 
can find 6' G M such that tp(6V&/a) = tp(6' 1 "6'/a). Thus the back-and-forth 
goes through. □ 

Proposition A. 2. Any colored order (M;<,Cj : i < a) is dp-minimal (in 
particular NIP). 

Proof. Let a G M be a singleton and I = (di : i G Q) be an indiscernible 
sequence of n-tuples. For each index i write di = (a], . . . , a"). We say that / 
is cut by a if there is k < n and i, i' G Q such that < a < a*;. 

Assume that / is not cut by a, then it follows from Lemma IA.1I that for 
any i < i' G Q, tp(a/dfdi') U tp((a^ : i < j < i')/dfdi') h tp(a + (aj : z < 
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j < i')/aCai'). As (aj : % < j < i') is indiscernible over afa^, it follows that 
(a,j : % < j < i') is indiscernible over a, and therefore I is indiscernible over a. 

If now / and J are two mutually indiscernible sequences, then a singleton 
a cannot cut both / and J (otherwise either some point of / would cut J or 
some point in J would cut /). Therefore at least one of / and J is indiscernible 
over a. It follows that the theory is dp-minimal. □ 

Trees 

Definition A. 3 (Tree). A tree is a structure (M, <) such that < defines a 
partial order on M and for every a G M, the set {x G M : x < a} is linearly 
ordered by <. 

A tree M is connected if for every a,b G M the set {x G M : x < a Ax < b} 
is non-empty. 

We say that (M, <) is a meet-tree if it is a connected tree and in addition, 
for every two points a, b G M, the set {x £ M : x < a A x < b} has a greatest 
element, which we write a A b. 

Let (M, <) be a meet-tree and c G M is a point. The closed cone of center 
c is by definition the set C(c) := {x G M : x > c}. We can define on C(c) 
a relation E c by: xi? c y if x A y > c. One can easily check that this is an 
equivalence relation. We define an open cone of center c to be a equivalence 
class under the relation E c . 

A leaf of the tree M, is point in M which is maximal. 

Let Tdt be the theory of dense meet-trees in the language {<, A} defined 
by saying that: 

- < defines a meet-tree on the universe, and A is defined as above; 

- for any point x, {y : y < x} is dense with no first element; 

- for any point x, there are infinitely many open cones of center x. 
This is a complete theory which admits elimination of quantifiers. 

Lemma A. 4. Let (M, <) be a tree, a G M, and let C denote the closed cone 
of center a. Let x = (x\ . . . , x n ) G (M\C) n andy= (y\ . . . , y m ) G C m . Then 
tp(x/a) U tp(y/a) h tp{x U y/a). 

Proof. The proof is back-and-forth as for Lemma IA.U □ 



Proposition A. 5. Any tree (M, <) is dp-minimal (hence NIP). 
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The proof is similar to that of Proposition IA.21 but slightly longer as we 
have to various cases to consider. We omit it here and refer the reader to [88| 
Proposition 4.7]. 

O-minimal and C-minimal structures 

Let (M; <,...) be a structure on which < defines a linear order. We say that 
M is o-minimal if every definable set of M (in dimension 1) is a finite union 
of intervals (closed or open). One can prove that this implies that any model 
of Th(M) is o-minimal. 

As mentioned in the introduction, o-minimality is a major area of research 
within model theory, as it is both a powerful and versatile framework. This 
condition was isolated by van den Dries [90] and first systematically studied 
by Pillay and Steinhorn in [67]. The authors showed the fundamental cell 
decomposition theorem which states that definable subsets in any dimension 
can be decomposed as a finite union of cells (for examples, cells in dimension 
1 are intervals). Thus o-minimal theories are geometrically tame: pathologies 
such as space-filling curves do not exist in them. An important landmark in 
the subject is the proof by Wilkie that M e x P , the reals with a predicate for 
the exponential function, is o-minimal. Van den Dries [91] (and later Denef 
and van den Dries [27]) proved that expanding the reals by adding analytic 
functions restricted to compact sets also yields an o-minimal structure. 

Theorem A. 6. Any o-minimal theory is dp-minimal. 

Proof. Let (M; <,...) be an o-minimal structure. Let / = (bi : i G Q) be an 
indiscernible sequence of n-tuples and let <p(x; y) be a formula, where x is a 
single variable. Then by o-minimality, for every i G Q, (p(x; bi) is a finite union 
of intervals. Note that the end points of those intervals are definable over bi. By 
indiscernability of the sequence (bi : i G Q), the number of those intervals and 
their type (open, closed) is constant as i varies; so are the functions sending bi 
to each of the end-points of those intervals. In other words, there is a number 
N, 0-definable functions (gk{y) '■ k < N) and a quantifier free formula 8(x; z) in 
the language {=, <} such that for each i 6 Q, the formula 4>(x; bi) is equivalent 
to B(x] g (y), ... ,g N _x (y)). 

Let a G M be a singleton. If the sequence (tp(6j/a) : i G Q) is not 
constant, it follows from the previous analysis that a cuts the indiscernible 
sequence (dcl(6j) : % G Q) (where cut is defined as in the proof of IA.2[) . If 
I = (pi : % G Q) and J = (di : i G Q) are mutually indiscernible, then the 
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sequences (dcl(frj) : i G Q) and (dcl(dj) : i G Q) are mutually indiscernible. 
Therefore a singleton a cannot cut both. This shows that the theory is dp- 
minimal. □ 

For more information about o-minimal theories, we refer the reader to van 
den Dries' book [93] and Wilkie's survey |96j . 

The structure (M; <,...) is weakly o-minimal if in every elementary exten- 
sion of M, every definable unary set is a finite union of convex sets. Adapting 
the proof of Theorem IA.61 one can show that any weakly o-minimal structure 
is dp- minimal. 

We now present C-minimality, which is the analogue of o-minimality for 
trees instead of linear orders. 

Let (M; <) be a meet-tree, and let Bm be the set of maximal branches of 
M, i.e., the set of maximal totally ordered subsets of M. Note that if m G M 
is a leaf of M then the set {x G M : x < m} is an element of Bm- Hence the 
set of leaves of M naturally embeds into Bm- Given a G Bm and m G M, we 
write m < a if m is in the branch a. Then (M U {Bm}, <) is a meet-tree, the 
leaves of which are exactly the elements of Bm- 

If a, b, c G B M , we write C(a, b, c) if either b = c ^ a or a, b, c are distinct 
and a A c < b A c. Then the structure (B M , C) satisfies the following axioms: 

(CI) Var, y, z[C(x, y, z) C(x, z, y)j; 

(C2) Var, y, z[C(x, y, z) -»■ ->C(y, x, z)]; 

(C3) Vx, y, z, w[C(x, y, z) -»■ (C7(w, y, z) V C(x, w, z)]\ 

(C4) Vx, j/ [a; 7^ y -»■ 3z ^ yC(x, y, z)]. 

Conversely, one can show that given a structure (M; C) where C(x,y,z) 
satisfies axioms (CI) - (C4), then there is a meet tree (T; <) for which M is 
the set of branches of T and the C-structure on M coincides with the one 
defined above. Such a structure is called a C-structure. 

A structure (M; C, . . .) is C-minimal if any definable unary subset is equiv- 
alent to a boolean combination of formulas of the form C(x, a,b), a, b G M. A 
theory is C-minimal if all of its models are C-minimal. 

Theorem A. 7. Any C-minimal theory is dp-minimal, hence NIP. 

The proof is essentially the same as that of Theorem IA.6I using the fact 
that the formula C(x;yz) has VC-dimension 1. 

C-minimality was defined by Macpherson and Steinhorn in [51]; a cell de- 
composition result was shown by Haskell and Macpherson in [39] • Further 
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work has been done by Delon, Maalouf, Simonetta. The main example of a 
C-minimal structure is that of algebraically closed valued fields (see below). 

Both o-minimality and C-minimality are subsumed in the notion of VC- 
minimal theory as defined by Adler in [3]. See also [33] and |25j . 



Ordered abelian groups 

An ordered abelian group is a structure (T; 0, +, <) such that (T; 0, +) is an 
abelian group on which < defines a linear order with the following compatibility 
condition: 



A quantifier elimination result for ordered abelian groups is proved by 
Schmitt in [73]. A similar result is proved by Cluckers and Halupczok in [23]. 
Both languages are somewhat complicated, so we do not give the details here. 
We however mention two important special cases. 

1. Let Td oa g be the theory of (non-trivial) divisible ordered abelian group 
(i.e., we add to the theory of ordered abelian groups the axiom (3x)x ^ and 
for every integer n, the axiom (Vx)(Ek/)n ■ y = x. The ordering ensures that 
this y is unique.) Then Td oa g is a complete theory and admits elimination of 
quantifiers in the language {0, +, <}. 

2. Let T pres be Presburger arithmetic, namely the theory of (Z; 0, 1, +, <). 
That theory does not admit elimination of quantifiers, but it does if we add 
predicates {P n : n < u} defined by P n (x) -H- (^y)n ■ y = x. 

Using the general quantifier elimination result, Gurevich and Schmitt prove 
in [36] the following theorem. 

Theorem A. 8. Any ordered abelian group (T; 0, +, <) is NIP. 
Valued fields 

Since the seminal work of Ax-Kochen and Ersov, valued fields have played 
a important role in algebraic model theory. They also provide interesting 
examples of NIP structures. 



A. 2 Valued fields 
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We will recall briefly some facts about valuations, but we assume some 
familiarity with this notion in the proofs. The reader is referred to [29] for 
more details. 

Let T be an ordered abelian group. We let oo be an additional formal 
element and extend the ordering and composition law on TU {00} by declaring 
that 00 is greater than all elements of T and setting 7 + 00 = 00 + 7 = 00 + 00 = 
00 for all 7 G T. 

A valued field with value group T is a field K equipped with a surjective 
map v : K — T U {00} satisfying: 
v(x) = 00 -<=>- x = 0; 
v(x + y) > min(f (x), v (?/)); 
v(xy) = v(x) + v (y). 

We write K v to denote the field K equipped with the valuation v. 

The valuation ring O of K v is the ring {x £ K : v(x) > 0}. It is a local 
ring, i.e., has a unique maximal ideal ffll = {x G K : v(x) > 0}. The quotient 
k = O/DJl is called the residue field of K v . We let res : O — > k be the canonical 
projection (called the residue map). 

A valued field can be considered in various languages. The language L div 
is a one sorted language (K; 0,1,+,— ,-,|) containing the ring language on K 
and a binary predicate | interpreted as x\y <^=^ v(x) < v(y). 

One can also consider valued fields in the three sorted language L res having 
as sorts K and k equipped with their respective ring structures and Y equipped 
with its ordered abelian group structure. We also have two function symbols 
between sorts: v : K* — > T and res : O — > k interpreted as the valuation and 
residue map. (If the reader is bothered by the fact that those functions are 
not defined everywhere, she can extend them formally by setting v(0) = and 
res(x) = when v(x) < 0.) 

One can define on any valued field a natural C-structure by 
C(x, y, z) = v (2 — z) < v(y — z). 

ACVF 

Let ACVF denote the theory of algebraically closed (non-trivially) valued fields 
in the language Ldiv- The fact that the valued field K v is algebraically closed 
forces the residue field k to be also algebraically closed and the value group V 
to be a divisible group. 



A.2. VALUED FIELDS 



103 



We let ACVF( 0j o), ACVF( 0>P ) and ACVF( PiP ) denote the theories of alge- 
braically closed valued fields where the pair (characteristic of K v , characteristic 
of k) is respectively equal to (0,0), (0,p) and (p,p). 

Theorem A. 9. The theory ACVF eliminates quantifiers in the language Lim- 
its completion are the theories ACVFiq^, ACVFcq jV ) and ACVF( PjP ), for p a 
prime number. 

This was essentially proved by Robinson in [73]. See also [19]. The following 
easily follows from the theorem. 

Theorem A. 10. Equipped with its natural C-structure, the theory ACVF is 
C -minimal, in particular it is dp-minimal, hence NIP. 

Henselian valued fields 

An angular component is a map ac : K v — > k which satisfies: 

- ac(0) = 0; 

- the restriction of ac to K* has image in k* and is a morphism of multi- 
plicative groups; 

- for any x G K v of valuation 0, ac(cc) = res(x). 

If x,y G K, then v(x) < v(y) implies v(x + y) = v(x) and &c(x + y) = ac(x). 
If v(x) = v(y), then either ac(x) ^ — ac(y), in which case v(x + y) = v(x) and 
ac(x + y) = ac(x) + ac(y), or ac(x) = — ac(y) in which case v(x + y) > v(x) 
and we cannot say anything about ac(x + y). 

Fact A. 11. If K v is an saturated valued field, then K v admits an angular 
component. 

See for example [20]. Keeping this fact in mind, we may restrict our study to 
valued fields with an angular component. Let Lp as be the language L res U{&c} 
where ac is a new function symbol from K to k. 

Theorem A. 12 (Pas [65]). The (incomplete) theory of henselian valued field 
of residue characteristic eliminates fields quantifiers in the language Lp as . 

See also [19] for a proof. Examples of such fields include C ((£)), the 
fields of Laurent series respectively over R, or C, or more generally any field of 
the type k((T)) where k is a field of characteristic and T an ordered abelian 
group (see [291 Exercise 3.5.6]). 

A direct consequence is the celebrated result of Ax-Kochen and Ersov (in- 
dependently) on elementary equivalence of henselian valued fields. 



104 



APPENDIX A. EXAMPLES OF NIP STRUCTURES 



Theorem A. 13 (Ax-Kochen, Ersov). Let (K,v) and (L,w) be two henselian 
valued fields. Denote by Tk,Fl their respective value groups equipped with 
the ordered group structure and by k, 1 the residue fields in the ring language. 
Assume that k and 1 have characteristic zero. Then we have: 

K v = L w (Fx = I~l and k = 1). 

In the same spirit, if K v is independent, then this can be traced down to the 
residue field (recall that the value group, being a pure ordered abelian group, 
is always NIP). 

Theorem A. 14 (Delon [26]). A Henselian field of residue characteristic is 
NIP if and only if both its value group and its residue field are NIP. 

Knowing Theorem IA.8| we do not need to mention the value group. 

Corollary A. 15. A Henselian field of residue characteristic is NIP if and 
only if its residue field is NIP. 

Delon's proof uses a coheir-counting argument and requires to first under- 
stand types. We sketch here a more direct (though not necessarily simpler) 
argument using indiscernible sequences. As discussed previously, we may as- 
sume that our field K v is equipped with an angular component ac, and we 
work in the language Lp as (the reduct of an NIP theory is NIP, so the result 
will follow for the languages L^ v and L res .) 

Lemma A. 16. Let P(X) = Sfc< n afcX fc be in K V [X] and let (xi : i < u) be a 

sequence of elements of K v such that (v(xi))i <UJ is monotonous (increasing or 
decreasing), then there are r < n and t < to such that for all i > t and k ^ r , 
v(P(xi)) = v(a r xl) < w(afcxf) (hence also ac(P(xi)) = a.c(a r x^)). 

Proof. As the sequence (v(xi))i <w is increasing, there is t < u such that for 
i > t, v(xj) has the same relative position with respect to all values of the form 
v( " ak l7^ k '\ f° r k,k' G T (if T is not divisible, those elements live in its divisible 
hull, to which the order extends in a unique way). In particular, v(xj) is not 
equal to any of those values. 

Hence for % > t, the elements v (a^) + k ■ v(xi) for k < n are pairwise 
distinct, and there is r < n not depending on i such that their minimum is 
v(a r ) + r ■ v(xi). □ 
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We now classify indiscernible sequences of singletons depending on their 
type in the C-structure. Let (xi : i < u) be an indiscernible sequence of 
singletons of K v . 

Case 0: the sequence (v(xi) : % < u) is non-constant. Then by indiscern- 
ability, it is either decreasing or increasing. 

We now assume that the sequence (v(xi) : i < uj) is constant. For < i < uj, 
let yi = Xi — xq. Note that it is not possible for the sequence (v(yi) : < i < uj) 
to be increasing. For then, we would have for i > 1, v(xi — x\) = v((yi) — 
v(yi)) = v(yi). Thus the sequence v(xi — x\) would be constant, while the 
sequence v(xi — xq) is not and this contradicts indiscernibility. 

Case I: The sequence (v(yi) : < i < u) is decreasing. 

Case II: The sequence (v(yi) : < i < uj) is constant and the sequence 
(ac(?/j) : < i < oj) is not constant. 

Case III: The sequences (v(yi) : < i < uj) and (ac(yj) : < i < uj) are 
both constant. Then we have v(x2 — X\) = v(y2 — yi) > v(x2 — xq). Let x w be 
such that (xi : % < uj) is indiscernible. For i < uj, we let Then the 

sequence (v(zi) : i < uj) is decreasing. 

Lemma A. 17. Let (x^ : % < uj) be an indiscernible sequence of singletons of 
K v . Then there are: 

- an indiscernible sequence (oti : i < uj) of elements of V, 

- an indiscernible sequence (pi : i < uj) of elements of k, such that: 

for any P(X) G there is r < uj, and 7 G T such that v(P(xi)) = 

7 + r.aii for all i large enough. 

Also, there is Q G k[X] such that for all i large enough, &c(P(xi)) = Q(bi). 

Proof. If the sequence (xi)i <l0 falls in case 0, we are done by Lemma IA.16I 
Assume it falls in case I. There is a polynomial DP(Y) G lf«[y] such that 
we have P(xq + Y) = P(xq) + DP(Y). Then for all < % < uj, we have 
P(xi) = P(xq) + DP(yi). We conclude by applying Lemma fA.161 to the se- 
quence (yj)o<i«j and the polynomial P(x ) + DP(Y). 

If [xi) i<u falls in case III, we write similarly P(x w + Z) = P(xJ) + DPi(Z) 
and apply Lemma |A. 161 to the sequence (zi) i<UJ and the polynomial DPi(Z). 

Finally, assume that we are in case II. Write P(X) = Ylk<n ak -^ k ■ 
vq = v(xq). Let A G n be the set of k < n for which v(ak) + k • Vq is minimal. 
Let q{t) G k[t] be the polynomial J2keA ac(afc)t fc . Then for some < uj, 
for every i* < i < uj, ac(aj) is not a root of q(t). For such an i, we have 
v(P(xi)) = vq and ac(P(xj)) = q(&c(xi)). The lemma follows. □ 



106 



APPENDIX A. EXAMPLES OF NIP STRUCTURES 



We now prove Theorem IA.14I 

By Theorem IA. 121 (and keeping Proposition 12.141 in mind), it is enough to 
show that the following formulas are NIP: 

• (p(x,y) = 0, where x, y are variables of sort K v and is a quantifier free 
formula in the ring language of K v ; 

• ijj(x,i(y)) where x is a variable of sort T, y variables of sorts K v and T, 
ip is a formula in the language of T, and i is a tuple of terms with image in T; 

• 6(x, i(y)) where x is a variables of the sort k, y variables from K v and k, 
9 is a formula in the language of k, and i is a tuple of terms with image in k; 

• ip(v(Pi(x,yi)), . . . ,v(P n (x,yi)),y 2 ) where y 2 are variables from Y and if) 
is a formula in the language of T; 

• 6 l (ac(P 1 (x, y~i)), ■ ■ ■ , ac(P n (x , yi)) , y 2 ) , where y 2 are variables from k and 
9 is a formula in the language of k. 

In the first three cases, the results follows from the fact that algebraically 
closed fields, Th{T) and Th(k) respectively are NIP. 

Assume that the formula <p(x; y u y 2 ) = i[)(v(Pi(x, y x )), v(P n (x, y x )), y 2 ) 
has IP. Then there is an indiscernible sequence (xi : i < u) of singletons and 
parameters bi, b 2 such that (p(xf, bi, b 2 ) holds if and only if i is even. 

By Lemma IA.17[ there is an indiscernible sequence (a.i : i < u) of elements 
of T and for k < n, r n < u and 7„ 6 T such that v(Pk(xi, b~i)) = 7fc + Tk-Oti for 
all % large enough. Hence we can replace each v(Pi(x, y~i)) in the formula ip by 
a term in the language of T and we obtain a contradiction to the fact that the 
ordered abelian group T is NIP. 

We treat similarly the last case. This finishes the proof of Theorem I A. 141 

The theory ACVF has been intensely studied in the last ten years. Haskell, 
Hrushovski and Macpherson provided in [37] a description of imaginaries. They 
showed in [38] how types can be decomposed into an o-minimal component 
coming from the value group and a stable quotient, internal to the residue 
field. This property is referred to as metastability. A measure-theoretic analog 
of the Ax-Kochen principle is studied in [33] by Hrushovski and Kazhdan. Most 
recently, Hrushovski and Loeser [32] have given a model-theoretic construction 
of Berkovich spaces from rigid geometry. 

The p-adics 

Let Q p denote the usual field of p-adic numbers. The valued field Q p is 
henselian, but of residue characteristic p, hence the previous results do not 
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apply. Nonetheless, its theory is well understood as we explain now. 

First we give an axiomatization of Th(Q p ) in the language L res . In addition 
to the paper [10] by Ax and Kochen where the result first appeared, we refer 
the reader to Cherlin's [201 H Th. 40]. 

Theorem A. 18. Let T p be the following theory, in the language L res : 

- Axioms for henselian valued fields of characteristic zero. 

- The residue field k is isomorphic to ¥ p . 

- The value group T satisfies the theory of (Z; 0, +, <). 

- v(p) = 1 is the smallest positive element ofT. 

Then T p is a complete theory, and is equal to the theory ofQ p in the language 
L res . 

It is worth noting that in the field Q p , the valuation is definable in the 
pure field structure (namely, the valuation ring is definable by the formula 
<f)(x) = 3t,t 2 = 1 + p 3 y 4 ), hence we also obtain an axiomatization of Th(Q p ) 
as a pure field. 

The valued field Q p does not admit elimination of field quantifiers in either 
Ldi V or L res . To remedy that, we present two enriched languages. First, let 
Lmac be the language L<n v to which we add predicates {P n : < n < u}. The 
valued field Q p is made into an L mac -structure by interpreting P n as the set of 
n-th powers. 

Theorem A. 19 (Macintyre [58]). The field Q p has elimination of quantifiers 
in the language L mac . 

Another way to obtain quantifier elimination is by adding angular compo- 
nents as we did in the case of residue characteristic zero. However, we now 
need to add infinitely many. Let K v be a valued field of characteristic 0, whose 
residue field has characteristic p. A family (ac n : n < u) is a compatible family 
of angular components if: 

- ac n is a map K v — > O ' jp n O\ 

- ac n (0) = 0; the restriction of ac n to K* has image in {O /p n O)* and is a 
morphism of groups; 

- let 7r n denote the canonical projection 7r n : 0/p n+1 — > Ofp n O, then 
7r n o ac n _|_i = ac n . 

Let Lp aS)bJ be the language L rcs U {ac n : n < u}. 

We can explicitely construct a family of compatible angular components on 
Q p . Let x e <Q>* and let v = v(x). Set x = x/p v . Then x e O \ pO. Now let 
ac n (x) be the image of x in 0/p n O. 
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The properties of the sequence (ac n : < n < uj) are easy to check. 

Theorem A. 20 (Belair [H]). The canonical expansion of Q p in the language 
Lp as ,ui admits elimination of quantifiers. 

Now one can adapt the proof of the previous subsection to show that Q p is 
NIP (or alternatively adapt Delon's proof, which is what Belair does in [Ti]). 
Actually, more is true. 

Theorem A. 21 (Dolich, Goodrick, Lippel [28]). For any prime p, the field Q p 
of p-adics in the language Ldi V is dp-minimal. 



Appendix B 
Stability 



Stability is the first, and most important, dividing line discovered by Shelah in 
his work on classification theory. Stable theories have a canonical notion of in- 
dependence (non-forking) from which a dimension theory and strong structure 
results follow. It has proved to be a fundamental and extremely rich property, 
despite the fact that stable structures are rare. In algebra, the main examples 
are algebraically closed fields, separably closed fields and differentially closed 
fields. 

We only record here basic facts about stability which are used in the text. 
The reader can consult (66] or [72] for more information. 

Definition B.l (Stable formula). A formula 0(x; y) is stable if there does not 
exist points (a^ : % < uS) such that: 



If a formula <p(x; y) is unstable, then by Ramsey, we can assume that the 
sequence (a^ : i < ui) witnessing instability is indiscernible. We easily obtain 
the following equivalent characterization. 

Lemma B.2. A formula (f)(x;y) is stable if and only if there is no indiscernible 
sequence (aj : i < u2) and b G W such that: 



There are a number of equivalent characterizations of stability. 
If A C U and p G S^(A) is a 0-type over A, we say that p is definable if 
there is a formula d<p(y; z) and a tuple b G A such that for all a G A we have 
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Proposition B.3. Let<p(x;y) be a formula. Then the following are equivalent: 

(i) 4>{x] y) is stable; 

(ii) for all A infinite, we have \S<j,(A)\ < \A\; 

(Hi) there is an integer k such that for all A, finite or infinite, we have 
\S,(A)\<\A\ k ; 

(iv) for all A, all (p-types over A are definable. 

Note that easily (iv) =>- (iii) =>- (ii) =>- (i). The implication (i) =^> (iv) is 
more complicated, see [661 Lemma 2.2]. Moreover, if A = M is a model, then 
every <j)-type over M can be defined by a 0-formula, i.e., dtp can be taken to 
be a boolean combination of formulas of the form <fr(x; b) for b e M . 



Appendix C 
Probability theory 



We recall here some basic results of probability theory that we need in this 
text. We refer the reader to any introductory book on the subject for more 
details. Alon and Spencer's book [5], although not a textbook on probability 
theory, is a nice reference. 

A probability space (X, B, /x) is a set X equipped with a cr-algebra B and a 
ex-additive measure /i on B such that n{A) = 1. For each integer k, the cartesian 
power X k is naturally equipped with the product a-algebra B® k which is the 
a-algebra generated by sets of the form B\ x • • • x Bk for B±, . . . , G B. The 
product measure is defined as the unique probability measure on (X k , B® k ) 
such that fi k (B 1 x • • • x B k ) = fi(Bi) ■ ■ ■ fi(B k ). 

If A C X is a measurable subset of X, we let 1a be its characteristic 
function. We write Prob(A) = fi(A). If /, g : X — > R are measurable functions, 
we will write {/ > g} for {x G X : f(x) > g(x)} and Prob(/ > g) instead of 
Prob({a; G X : f(x) > g(x)}). 

A measurable function / : X — > R is called a random variable. Its ex- 
pectancy E(/) is defined as J f(x)d/i. Its variance is defined as Var(/) = 
E((/-E(/)) 2 ). 

Two random variables /, g : X — y R are uncorrelated if E(/p) = E(/) E(g). 
If /, (7 are uncorrelated random variables and a, b G R, then we have Var(a ■ 
f + b-g) = a 2 Var(/) + 6 2 Var(#). 

Random variables /i, . . . , f n : X — >■ R are said to be independent if for any 
measurable sets Bi, . . . ,B n of R, we have Prob(P|{/fc G = n P r °b(/fc G 
In particular, two independent variables are uncorrelated. 
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Lemma C.l (First moment). Let f : X — > IR + be a random variable. Then 
for all r > 0, we have 

Prob(/ > r) < 



□ 



be measurable. 



Proof. Write that E(/) < E(/.l {/ > r} ) < rProb(/ > r). 

Proposition C.2 (Chebyshev's inequality). Let f : X — > 

Then we have 

Prob(|/-E(/)|>e)<^Q. 



Proof. Apply the first moment inequality to the random variable (/ — E(/)) 2 
and r = e 2 . □ 

Proposition C.3 (Weak law of large numbers). Let A C X be a measurable 
subset of X and fix e > 0, then 



lim // 



n 

n ^ 

i=l 



> e 



0. 



Proof. Fix some integer n. For i < n, the random variable l{ Xig A} : AT n — > R 
has expectancy equal to and variance [a(A)(1 — fi{A)) < 1. Also for 

i ^ j, the random variables l{ Xi€j 4} and l^gA} are uncorrelated (in fact, 
they are independent). It follows that the random variable - XX=i IfoeA} has 
expectancy and variance < 1/n. 

By Chebychev's inequality, we deduce that 



Prob 



(|^Ei ( ^}-^)|> £ )<^, 



from which the result follows. 



□ 



Proposition C.4 (Chernoff's bound, special case). Let fi, ■ ■ ■ , f n be indepen- 
dent random variables, such that Prob(/fc = 1) = Prob(/fc = ~ 1) = 1/2, for 
all k. Let e > 0, then letting g — -^2 fk> we have 



Prob ( | p | > e) < 2exp 
For a proof, see [HI Appendix A] 
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